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METHODS OF DIAGNOSIS OF CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF CANCER 

FIELD OF THE INVENTION 

5 The invention relates to the identification of nucleic acid and protein expression 

profiles and nucleic acids, products, and antibodies thereto that are involved in cancer and 
other diseases; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of these conditions. The invention further relates to methods for 
identifying and using agents and/or targets that modulate these conditions. 

10 BACKGROUND OF THE INVENTION 

Cancer is a major cause of morbidity in the United States. For example, in 1996, the 
American Cancer Society estimated that 1,359,150 people were diagnosed with a malignant 
neoplasm and 554,740 died from one of these diseases. Cancer is responsible for 23.9 
percent of all American deaths and is exceeded only by heart disease as a cause of mortality 

15 (33 percent). Unfortunately, cancer mortality is increasing and sometime early in this 

century, cancer is expected to become the leading cause of mortality in the United States as 
it already is in Japan. 

Cancers share the charactaristic of disordered control over normal cell division, 
growth, and differentiation. Their initial clinical manifestations are extremely 

20 heterogeneous, with over 70 types of cancer arising in virtually every organ and tissue of 
the body. Moreover, some of those similarly classified cancer types may represent multiple 
different molecular diseases. Unfortunately, some cancers may be virtually asymptomatic 
until late in the disease course, when treatment is more difficult, and prognosis grim. 

Treatment for cancer typically includes surgery, chemotherapy, and/or radiation 

25 therapy. Although nearly 50 percent of cancer patients can be effectively treated using 

these methods, the current therapies all induce serious side effects which diminish quality of 
life. The identification of novel therapeutic targets and diagnostic markers will be 
important for improving the diagnosis, prognosis, and treatment of cancer patients. 

Recent advances in molecular medicine have increased the interest in tumor-specific 

30 antigens that could serve as targets for various immunotherapeutic or small molecule 

strategies. Antigens suitable for immunotherapeutic strategies should be highly expressed 
in cancer tissues, preferably accessible from the vasculature and at the cell surface, and 
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ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated, e.g., reproductive organs, especially those absent in one 
sex. Examples of antigens that are currently available for the detection and treatment of 
certain cancers include Her2/neu and the B-cell antigen CD20. Humanized monclonal 
5 antibodies directed to Her2/neu (Herceptin®/trastuzumab) are currently in use for the 

treatment of metastatic breast cancer. See Ross and Fletcher (1998) Stem Cells 16:413-428. 
Similarly, anti-CD20 monoclonal antibodies (Rituxin®/rituximab) are used to effectively 
treat non-Hodgkin's lymphoma. See Maloney, et al. (1 997) Blood 90:21 88-2195; Leget and 
Czuczman (1998) Curr. Opin. Oncol. 10:548-551. 

1 0 The elucidation of a role for novel proteins and compounds in disease states for 

identification of therapeutic targets and diagnostic markers is valuable for improving the 
current treatment of cancer patients. Accordingly, provided herein are molecular targets for 
therapeutic intervention in various defined cancers. Additionally, provided herein are 
methods that can be used in diagnosis and prognosis of cancer. Further provided are 

1 5 methods that can be used to screen candidate bioactive agents for the ability to modulate 
cancer. 

SUMMARY OF THE INVENTION 

The present invention provides methods for detecting a pathological cell in a patient, 
the method comprising detecting a nucleic acid or polypeptide comprising a sequence at 

20 least 80% identical to a sequence described in Table 2 or the attached listing of SEQ ID 

NOs:l-l 16 in a biological sample from the patient, thereby detecting, either qualitatively or 
quantitatively, the pathological cell. In certain embodiments of the method, the pathological 
cell has a pathology (i.e. disease state, abnormality, or medical condition) selected from 
those listed in Table 1, including cancer. In some embodiments of the method, the 

25 biological sample comprises nucleic acids (e.g. mRNA); the biological sample is tissue 
from an organ which is affected by a pathology listed in Table 1, including a cancer; a 
further step is used of amplifying nucleic acids before the step of detecting the nucleic acid; 
the detecting is of a protein encoded by the nucleic acid; the nucleic acid comprises a 
sequence as described in Table 2 or the attached listing of SEQ ID NOs: 1-1 1 6 ; the 

30 detecting step is carried out by using a labeled nucleic acid probe, utilizing a biochip 

comprising a sequence at least 80% identical to a sequence as described in Table 2 or the 
attached listing of SEQ ID NOs:l-l 16 , or detecting a polypeptide encoded by a nucleic 
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acid; or the patient is undergoing a therapeutic regimen to treat a pathology of Table 1, or is 
suspected of having a pathology (e.g. cancer). 

Compositions are also provided, e.g., an isolated nucleic acid molecule comprising a 
sequence as described in Table 2 or SEQ ID NOs: 1-58, including, e.g., those which are 
5 labeled; an expression vector comprising such nucleic acid; a host cell comprising such 

expression vector; an isolated polypeptide which is encoded by such a nucleic acid molecule 
comprising a sequence as described in Table 2 or SEQ ID NOs:59-l 16; or an antibody that 
specifically binds a polypeptide comprising a sequence selected from those listed in SEQ ED 
NOs:59-l 16. In particular embodiments, the antibody is conjugated to an effector 

10 component, is conjugated to a detectable label (including, e.g., a fluorescent label, a 

radioisotope, or a cytotoxic chemical), an antibody fragment, or is a humanized antibody. 

Additional methods are provided, including methods for specifically targeting a 
compound to a pathological cell in a patient, the method comprising administering to the 
patient an antibody conjugated to, or capable of binding to, the compound, as described, 

1 5 thereby providing the targetting. Others include, e.g., methods for determining the presence 
or absence of a pathological cell in a patient, the methods comprising contacting a 
biological sample with an antibody, as described. In more particular methods, the antibody 
is: conjugated to an effector component, or to a fluorescent label; or the biological sample is 
a blood, serum, urine, or stool sample. 

20 Further methods include those for identifying, or screening, compounds that 

modulate the function of pathology-associated polypeptides (e.g. polypeptides that have 
been identified associated with a disease state via gene expression analysis), the method 
comprising: contacting the compound with a pathology-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

25 80% identical to a sequence as described in Table 2 or the attached listing of SEQ ID 
NOs:l-l 16 ; and determining the effect of the compound upon the function of the 
polypeptide. Another drug screening assay method comprises steps of: administering a test 
compound to a mammal having a pathology of Table 1 or a cell isolated therefrom; and 
comparing the level of gene expression of a polynucleotide that selectively hybridizes to a 

30 sequence at least 80% identical to a sequence as described in Table 2 or the attached listing 
of SEQ ID NOs: 1-116 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell or mammal, wherein a test compound that modulates the 
level of expression of the polynucleotide is a candidate for the treatment of the pathology. 
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DETAILED DESCRD?TION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for various disorders, e.g., angiogenesis, 
fibrosis, and various defined forms of cancer, including metastatic cancer, as well as 
methods for screening for compositions which modulate such conditions. Also provided are 
methods for treating such disorders or cancers. See, e.g., American Society of Clinical 
Oncology (ed. 2001) ASCO Curriculum: Svmntoni Manag e* Kendall/Hunt, ISBN: 
0787277851; Bonadonna, et al. (2001) Textbook of Breast fWer ( 2 d ed.) Dunitz Martin, 
ISBN: 1853178241; Devita and Hellman (eds. 2001) Cancer Principles and Practice of 
Oncology (2 vols.), Lippincott Williams, ISBN: 0781723876; Howell, et al. (2001) Breast 
Cancer Isis Medical Media, ISBN: 1901865584; Kaye and Laws (2001) Brain Tumours: An 
Encyclopedic Ap proach (2d ed.) Churchill Livingstone, ISBN: 0443064261; Mihm, et al. 
(2001) The Melanocyte Proliferation: A Comprehensive Tevft ook of Pigments T Pg ,w 
Wiley-Liss, ISBN: 0471252719; Montgomery and Aaron (2001) Clinical Path^og y ofSoft. 
Tissue Tumors Marcel Dekker, ISBN: 0824702905; Petrovich, et al. (eds. 2001) Combined 
Modality of Central Nervous System Tumors (Medical Radiology) Springer Verlag, ISBN: 
3540660534; Rosen (2001) Rosen's Breast Pathology Lippincott Williams and Wilkins, 
ISBN: 0781723795; Shah, et al. (2001) Oral Cancer Isis Medical Media, ISBN: 
189906687X; Weiss and Goldblum (2001) Enzinger and Weil 's Soft Tissue Tn^ (4t h 
ed.) Mosby, ISBN: 0323012000; Abeloff, et al. (eds. 2000) Clinical Oncology (2d ed.) 
Churchill Livingstone, ISBN: 044307545X; American Society of Clinical Oncology (ed. 
200 °) Cancer Genetics and Cancer P r edisposition Testing Kendall/Hunt, ISBN: 
0787276154; Fletcher (2000) Diagnostic Histnp athology of Tnmor* ( 2 vols. 2d ed.) 
Churchill Livingstone, ISBN: 0443079927; Vogelzang (ed. 2000) Comprehensive TevthooV 
of Genitourinary Oncology (2d ed.) Lippincott Williams and Wilkins, ISBN: 0683306456; 
Holland, et al. (eds. 2000) Holland-Frei Cancer MeHi™» (B ook with C D-ROM 5th ed.) ' 
Decker, ISBN: 1550091131; Turrisi, etal. (2000) LjingCancer Isis Medical Media, ISBN: 
1901865428; Bartolozzi and Lencioni (eds. 1999) Liver Malignance s: Diagnostic and 
Interventional Radiology (Medical Radiology) Springer Verlag, ISBN: 3540647562; 
Gasparini (ed. 1999) Pro g n ostic Variables in Node-Nep atiye and Node-Posits p J,,, 
Cancer Kluwer, ISBN: 0792384474; Hansen (ed. 1999) The LAST. C. TevrhooV oft „„ r 
Cancer: International Association for the Study of T,,n r r™^ Martin> ISBN . 
1853177083; Raghavan, et al. (eds. 1999) Textbook of Uncommon r* (2nd ed>) wileV) 
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ISBN: 0471929212; Thawley, et al. (eds. 1999) Comprehensive Management of Head and 
Neck Tumors (2 vols.) Saunders, ISBN: 0721655823; Whittaker and Holmes (eds. 1999) 
Leukemia and Related Disorders (3d ed.) Blackwell Science, ISBN: 0865426074; Aapro 
(ed. 1998) OncoMedia: Medical Oncolopy (CD-ROM) Elsevier Science, ISBN: 
5 0080427480; Abeloff (1998) Clinical Oncology (Library Version 2 CD-ROM Individual 
Version 2.0 Windows and Macintosh) Harcourt Brace, ISBN: 0443075557; Benson (ed. 
1998 ) Gastrointestinal Oncology (Cancer Treatment and Research, CTAR 98) Kluwer, 
ISBN: 0792382056; Brambilla and Brambilla (eds. 1998) Lung Tumors: Fundamental 
Biology and Clinical Management (Vol 124) Marcel Dekker, ISBN: 0824701607; Canellos, 

10 et al. (eds. 1998) The Lymphomas Saunders, ISBN: 0721650309; Greenspan and Remagen 
( 1998 ) Differential Diagnosis of T umors and Tumor-Like Lesions of Bones anH Tmnts 
Lippincott Williams and Wilkins Publishers, ISBN: 0397517106; Hiddemann (ed. 1998) 
Acute Leukemias VII: Experimental App ro aches and Novel Therap ies (Haematologie Und 
Bluttransfusion, Vol 39), Springer Verlag, ISBN: 3540635041; Husband and Reznek (1998) 

1 5 Imaging in Oncology (2 vols.) Mosby, ISBN: 1 899066489; Leibel and Phillips (eds. 1 998) 
Textbook of Radiation Oncology Saunders, ISBN: 0721653367; Maloney and Miller (eds. 
1 998 ) Cutaneous Oncology: Pathophysiology. Diagnosis, and Manap ement Blackwell 
Science, ISBN: 0865425175; Mittal, et al. (eds. 1998) Advances in Radiation Therapy 
Kluwe, ISBN: 079239981 1 ; Oldham (ed. 1998) Principles of Cancer Biotherapv (3d ed.) 

20 Kluwer, ISBN: 0792335074; Ozols (ed. 1998) Gynecologic Oncolo p y Kluwer, ISBN: 

0792380703; Parkin, et al. (eds. 1998) Cancer Inciden ce in Five Continents (I arc Scientific 
Publications, No 143) Oxford University Press, ISBN: 9283221435; Perez and Brady (eds. 
1998 ) Principles and Practice of Radiation Oncolopy Lippincott Williams and Wilkins, 
ISBN: 0397584164; Black, etal. (eds. 1997) Cancer of the Nervous Sy stem rwwii 

25 Science, ISBN: 0865423849; Bonadonna, et al. (1 997) Textbook of Breast Cancer: A 
Clinical Guide to Therapy Blackwell Science, ISBN: 1853173487; Pollock (ed. 1997) 
Surgical Oncology Kluwer, ISBN: 0792399005; Sheaves, et al. (eds. 1997) Clinical 
Endocrine Oncolopy Blackwell Science, ISBN: 086542862X; Vahrson (1997) Radiation 
Oncology of Gynecological Cancers Springer Verlag, ISBN: 0387567682; Walterhouse and 

30 Cohn (eds. 1 997) Diagnostic and Therape utic Advances in Pediatric Onr.n1r.py Kluwer, 
ISBN: 0792399781; Aisner (ed. 1996) Comprehensive Textbook of Thoracic OncoW 
Lippincott, Williams and Wilkins, ISBN: 0683000624; Bertino, et al. (eds. 1996) 
Encyclopedia of Cancer (3 vols.) Academic, ISBN: 012093230X; Cavalli, et al. (1996) 
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Textbook of Medical Oncology Dunitz Martin, ISBN: 1853172901; Peckham, etal. (eds. 
1995) Oxford Textbook of Oncology (2-Vols.) Oxford University Press, ISBN: 
0192616854; and Freireich and Kantarjian (eds. 1996) Molecular Penn ies and Therapy nf 
Leukemia (Cancer Treatment and Research, V. 84) Kluwer, ISBN: 0792339126. 

In particular, identification of markers selectively expressed on defined cancers 
allows for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, 
the invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics 
which use the marker expression for selective localization or modulation of function (for 
those markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of the diseases, e.g., as provided in Table 1, which 
subsets may actually require very different treatments. Moreover, the markers may also be 
important in related diseases to the specific disorders and cancers, e.g., which affect similar 
tissues in non-malignant diseases, or have similar mechanisms of induction/maintenance. 
Metastatic processes or characteristics may also be targeted. Diagnostic and prognostic uses 
are made available, e.g., to subset related but distinct diseases, or to determine treatment 
strategy. The detection methods may be based upon nucleic acid, e.g., PCR or hybridization 
techniques, or protein, e.g., ELISA, imaging, IHC, etc. The diagnosis maybe qualitative or 
quantitative, and may detect increases or decreases in expression levels. 

Table 2 provides unigene cluster identification numbers for the nucleotide sequence 
of genes (SEQ ID NOs: 1-58) that exhibit increased or decreased expression in diseased 
samples, particularly sequences involved in angiogenesis, arthritis, prostate cancer, breast 
cancer, colorectal cancer, cervical cancer, bladder cancer, head and neck cancer, esophageal 
cancer, lung cancer, ovarian cancer, pancreatic cancer, renal cancer, stomach cancer, skin 
cancer, testicular cancer, uterine cancer, glioblastoma, Ewing sarcoma, soft tissue sarcoma, 
and lung fibrosis. Table 2 also provides an exemplar accession number thatprovides a 
nucleotide sequence that is part of the unigene cluster. 

30 Definitions 

The term "cancer protein" or "cancer polynucleotide" or "cancer-associated 
transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, mutants, and 

-6- 
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interspecies homologues that: (1) have a nucleotide sequence that has greater than about 
60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably about 92%, 
94%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, preferably over a 
region of over a region of at least about 25, 50, 100, 200, 500, 1000, or more nucleotides, to 
5 a nucleotide sequence of or associated with a gene of Table 2 or SEQ ED NOs: 1-58; (2) bind 
to antibodies, e.g., polyclonal antibodies, raised against an immunogen comprising an 
amino acid sequence encoded by a nucleotide sequence of or associated with a gene of 
Table 2 or SEQ ID NOs: 1-58, and conservatively modified variants thereof; (3) specifically 
hybridize under stringent hybridization conditions to a nucleic acid sequence, or the 
10 complement thereof of Table 2 or SEQ ID NOs:l-58 and conservatively modified variants 
thereof; or (4) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, preferably 90%, 91%, 93%, 95%, 97%, 
98%, or 99% or greater amino sequence identity, preferably over a region of over a region 
of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an amino acid 
1 5 sequence encoded by a nucleotide sequence of or associated with a gene of Table 2 or SEQ 
ID NOs: 1-58. A polynucleotide or polypeptide sequence is typically from a mammal 
including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, 
pig, horse, sheep, or other mammal. A "cancer polypeptide" and a "cancer polynucleotide," 
include both naturally occurring or recombinant forms. 
20 A "full length" cancer protein or nucleic acid refers to a cancer polypeptide or 

polynucleotide sequence, or a variant thereof, that contains elements normally contained in 
one or more naturally occurring, wild type cancer polynucleotide or polypeptide sequences. 
The "full length" may be prior to, or after, various stages of post-translational processing or 
splicing, including alternative splicing. 
25 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
30 archival samples, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
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dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a bird; reptile; or fish. Livestock 
and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
5 cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing 
the methods of the invention in vivo. Archival tissues or materials, having treatment or 
outcome history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 

10 acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
same or have a specified percentage of amino acid residues or nucleotides that are the same 
(e.g., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 93%, 95%, 97%, 98%, 
99%, or higher identity over a specified region, when compared and aligned for maximum 
correspondence over a comparison window or designated region) as measured using, e.g., a 

1 5 BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described 
below, or by manual alignment and visual inspection (see, e.g., the NCBI web site, or the 
like). Such sequences are then said to be "substantially identical." This definition also 
refers to, or may be applied to, the complement of a test sequence. The definition also 
includes sequences that have deletions and/or insertions, substitutions, and naturally 

20 occurring, e.g., polymorphic or allelic variants, and man-made variants. As described 

below, the preferred algorithms can account for gaps and the like. Preferably, identity exists 
over a region that is at least about 25 amino acids or nucleotides in length, or more 
preferably over a region that is about 50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to 

25 which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 

30 relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from about 20 to 600, 
usually about 50 to 200, more usually about 100 to 1 50, in which a sequence may be 
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compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 
well-known. Optimal alignment of sequences for comparison can be conducted, e.g., by the 
local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482-489, by 
5 the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443- 
453, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl Acad. 
Sci. USA 85:2444-2448, by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual 

10 inspection (see, e.g., Ausubel, et al. (eds. 1995 and supplements) Current Protocols in 
Molecular Biology Wiley). 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. 

15 (1990) J. Mol. Biol 215:403-410. BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins 
of the invention. Software for performing BLAST analyses is publicly available through 
the web-site for National Center for Biotechnology Information (NCBI). This algorithm 
involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 

20 length W in the query sequence, which either match or satisfy some positive-valued 

threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 

25 cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 

30 quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
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uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1992) Proc. Natl. Acad. Sci. USA 89:10915-919) alignments 
5 (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity between 
two sequences. See, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873- 
5787. One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match 

1 0 between two nucleotide or amino acid sequences would occur by chance. For example, a 
nucleic acid is considered similar to a reference sequence if the smallest sum probability in 
a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, 
more preferably less than about 0.01, and most preferably less than about 0.001. Log values 
may be negative large numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 

1 5 An indication that two nucleic acid sequences are substantially identical is that the 

polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
peptides differ only by conservative substitutions. Another indication that two nucleic acid 

20 sequences are substantially identical is that the two molecules or their complements 

hybridize to each other under stringent conditions. Yet another indication that two nucleic 
acid sequences are substantially identical is that the same primers can be used to amplify the 
sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
25 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection (ATCC) catalog or web site). 
30 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
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chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than 
protein encoded by the gene. The term "purified" in some embodiments denotes that a 

5 nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. 
Preferably, it means that the nucleic acid or protein is at least about 85% pure, more 
preferably at least 95% pure, and most preferably at least 99% pure. "Purify" or 
"purification" in other embodiments means removing at least one contaminant or 
component from the composition to be purified. In this sense, purification does not require 

10 that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in 
which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 

15 containing modified residues, and non-naturally occurring amino acid polymers. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as 
well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 

20 carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that 
have the same basic chemical structure as a naturally occurring amino acid, e.g., an a 
carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., 
homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs 
may have modified R groups (e.g., norleucine) or modified peptide backbones, but retain 

25 somebasic chemical structure as a naturally occurring amino acid. Amino acid mimetic 
refers to a chemical compound that has a structure that is different from the general 
chemical structure of an amino acid, but that functions similarly to another amino acid. 

Amino acids may be referred to herein by either their commonly known three letter 
symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 

30 Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variant" applies to both amino acid and nucleic acid 
sequences. With respect to particular nucleic acid sequences, conservatively modified 
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variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids 
5 encode most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode 
the amino acid alanine. Thus, at each position where an alanine is specified by a codon, the 
codon can be altered to another of the corresponding codons described without altering the 
encoded polypeptide. Such nucleic acid variations are "silent variations," which are one 
species of conservatively modified variations. Every nucleic acid sequence herein which 

10 encodes a polypeptide also describes silent variations of the nucleic acid. In certain 

contexts each codon in a nucleic acid (except AUG, which is ordinarily the only codon for 
methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified 
to yield a functionally similar molecule. Accordingly, a silent variation of a nucleic acid 
which encodes a polypeptide is implicit in a described sequence with respect to the 

1 5 expression product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 
deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 

20 substitution of an amino acid with a chemically similar amino acid. Conservative 

substitution tables providing functionally similar amino acids are well known. Such 

conservatively modified variants are in addition to and do not exclude polymorphic variants, 
*• 

interspecies homologs, and alleles of the invention. Typically conservative substitutions 
include for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 

25 (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (£), 

Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 
(W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton 
(1984) Proteins: Structure and Molecular Properties Freeman). 

Macromolecular structures such as polypeptide structures can be described in terms 

30 of various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (eds. 2001) Molecular Biology of the Cell (4th ed.) Garland; and Cantor and 
Schimmel (1980) Biophysical Chemistry Part I: The Conformation of Biological 
Macromolecules Freeman. "Primary structure" refers to the amino acid sequence of a 
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particular peptide. "Secondary structure" refers to locally ordered, three dimensional 
structures within a polypeptide. These structures are commonly known as domains. 
Domains are portions of a polypeptide that often form a compact unit of the polypeptide and 
are typically 25 to approximately 500 amino acids long. Typical domains are made up of 
5 sections of lesser organization such as stretches of (3-sheet and a-helices. "Tertiary 
structure" refers to the complete three dimensional structure of a polypeptide monomer. 
"Quaternary structure" refers to the three dimensional structure formed, usually by the 
noncovalent association of independent tertiary units. Anisotropic terms are also known as 
energy terms. 

10 "Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50, or more nucleotides in length, 
up to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of 
any length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 

1 5 10,000, etc. A nucleic acid of the present invention will generally contain phosphodiester 
bonds, although in some cases, nucleic acid analogs are included that may have at least one 
different linkahge, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 
Practical Approach Oxford Univ. Press); and peptide nucleic acid backbones and linkages. 

20 Other analog nucleic acids include those with positive backbones; non-ionic backbones, and 
non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 
5,034,506, and Chapters 6 and 7 of Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ACS Symposium Series 580. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 

25 acids. Modifications of the ribose-phosphate backbone may be done for a variety of 
reasons, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic acids and analogs may be made. 

30 A variety of references disclose such nucleic acid analogs, including, e.g., 

phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-1963 and references 
therein; Letsinger (1970) J. Org. Chem. 35:3800-3803; Sprinzl, et al. (1977) Eur. J. 
Biochem. 81:579-589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. 
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(1984) Chem. Lett. 805, Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-4471; and 
Pauwels, et al. (1986) Chemica Scripta 26:141-149), phosphorothioate (Mag, et al. (1991) 
Nucleic Acids Res. 19:1437-441; and U.S. Patent No. 5,644,048), phosphorodithioate (Brill, 
et ai. (1989) J. Am. Chem. Soc. 1 1 1 :2321-2322), O-methylphophoroamidite linkages (see 
5 Eckstein (1 992) Oligonucleotides and Analogues: A Practical Approach. Oxford Univ. 
Press), and peptide nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. 
Soc, 114:1895-1897; Meier, et al. (1992) Chem. Int. Ed. Engl. 31:1008-1010: Nielsen 
(1993) Nature 365:566-568; Carlsson, et al. (1996) Nature 380:207, all of which are 
incorporated by reference). Other analog nucleic acids include those with positive 

10 backbones (Denpcy, et al. (1995) Proc. Natl. Acad. Sci. USA 92:6097-101; non-ionic 
backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141, and 4,469,863; 
Kiedrowski, et al. (1991) Angew. Chem. Intl. Ed. English 30:423-426; Letsinger, et al. 
(1988) J. Am. Chem. Soc. 1 10:4470-4471; Letsinger, et al. (1994) Nucleoside and 
Nucleotide 13:1597; Chapters 2 and 3 in Sanghvi and Cook (eds. 1994) Carbohydrate 

1 5 Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et al. (1 994) 
Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular NMR 
34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743) and non-ribose backbones, including 
those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7 in 
Sanghvi and Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ACS 

20 Symposium Series 580. Nucleic acids containing one or more carbocyclic sugars are also 
included within one definition of nucleic acids (see Jenkins, et al. (1995) Chem. Soc. Rev. 
pp 169-176). Several nucleic acid analogs are described in Rawls (page 35, June 2, 1997) 
C&E News . 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
25 nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, 
in contrast to the highly charged phosphodiester backbone of naturally occurring nucleic 
acids. This results in at least two advantages. The PNA backbone exhibits improved 
hybridization kinetics. PNAs have larger changes in the melting temperature (T^ for 
mismatched versus perfectly matched basepairs. DNA and RNA typically exhibit a 2-4° C 
30 drop in T m for an internal mismatch. With the non-ionic PNA backbone, the drop is closer 
to 7-9° C. Similarly, due to their non-ionic nature, hybridization of the bases attached to 
these backbones is relatively insensitive to salt concentration. In addition, PNAs are not 

degraded by cellular enzymes, and thus can be more stable. 
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The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. The depiction of a single 
strand also defines the sequence of the complementary strand; thus the sequences described 
herein also provide the complement of the sequence. The nucleic acid may be DNA, both 
5 genomic and cDNA, RNA, or a hybrid, where the nucleic acid may contain combinations of 
deoxyribo- and ribo-nucleotides, and combinations of bases, including uracil, adenine, 
thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. 
"Transcript" typically refers to a naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or 
mRNA. As used herein, the term "nucleoside" includes nucleotides and nucleoside and 
10 nucleotide analogs, and modified nucleosides such as amino modified nucleosides. In 
addition, "nucleoside" includes non-naturally occurring analog structures. Thus, e.g., the 
individual units of a peptide nucleic acid, each containing a base, aTe referred to herein as a 
nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
1 5 photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. In general, labels fall into three classes: a) isotopic labels, which may be radioactive 
or heavy isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags; 
and c) colored or fluorescent dyes. The labels may be incorporated into the cancer nucleic 
acids, proteins, and antibodies. For example, the label should be capable of producing, 
20 either directly or indirectly, a detectable signal. The detectable moiety may be a 

radioisotope, such as 3 H, 14 C, 32 P, 35 S, or 125 I, electron-dense reagents, a fluorescent or 
chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or 
an enzyme (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and 
proteins or other entities which can be made detectable such as alkaline phosphatase, beta- 

25 galactosidase, or horseradish peroxidase. Methods are known for conjugating the antibody 
to the label. See, e.g., Hunter, et al. (1962) Nature 144:945; David, et al. (1974) 
Biochemistry 13:1014-1021; Pain, etal. (198n J. Immunol. Meth. 40:219-230: andNygren 
(1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 

30 bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an 
antibody. The "effector" can be a variety of molecules including, e.g., detection moieties 
including radioactive compounds, fluorescent compounds, enzymes or substrates, tags such 
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as epitope tags, toxins; activatable moieties, chemotherapeutic agents; lipases; antibiotics; 
chemoattracting moieties, immune modulators (micA/B), or radioisotopes, e.g., emitting 
"hard" beta, radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, e.g., 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may 
be detected by detecting the presence of the label bound to the probe. Alternatively, 
methods using high affinity interactions may achieve the same results where one of a pair of 
binding partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (e.g., A, G, C, or T) or modified 
bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, preferably one that does not functionally interfere 
with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 
bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 
target sequences lacking complete complementarity with the probe sequence depending 
upon the stringency of the hybridization conditions. The probes are preferably directly 
labeled, e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, 
e.g., with biotin to which a streptavidin complex may later bind. By assaying for the 
presence or absence of the probe, one can detect the presence or absence of the select 
sequence or subsequence. Diagnosis or prognosis may be based at the genomic level, or at 
the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein, or vector, has been modified 
by the introduction of a heterologous nucleic acid or protein or the alteration of a native 
nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) 
form of the cell or express native genes that are otherwise abnormally expressed, under 
expressed, or not expressed at all. By the term "recombinant nucleic acid" herein is meant 
nucleic acid, originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., 
using polymerases and endonucleases, in a form not normally found in nature. In this 

-16- 



WO 2004/073657 



PCT/US2004/005455 



manner, operably linkage of different sequences is achieved. Thus an isolated nucleic acid, 
in a linear form, or an expression vector formed in vitro by ligating DNA molecules that are 
not normally joined, are both considered recombinant for the purposes of this invention. It 
is understood that once a recombinant nucleic acid is made and reintroduced into a host cell 
5 or organism, it will replicate non-recombinantly, e.g., using the in vivo cellular machinery 
of the host cell rather than in vitro manipulations; however, such nucleic acids, once 
produced recombinantly, although subsequently replicated non-recombinantly, are still 
considered recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, 

1 0 e.g., through the expression of a recombinant nucleic acid as depicted above. A 

recombinant protein is distinguished from naturally occurring protein by at least one or 
more characteristics. The protein may be isolated or purified away from some or most of 
the proteins and compounds with which it is normally associated in its wild type host, and 
thus may be substantially pure. An isolated protein is unaccompanied by at least some of 

1 5 the material with which it is normally associated in its natural state, preferably constituting 
at least about 0.5%, more preferably at least about 5% by weight of the total protein in a 
given sample. A substantially pure protein comprises at least about 75% by weight of the 
total protein, with at least about 80% being preferred, and at least about 90% being 
particularly preferred. The definition includes the production of a cancer protein from one 

20 organism in a different organism or host cell. Alternatively, the protein may be made at a 
significantly higher concentration than is normally seen, through the use of an inducible 
promoter or high expression promoter, such that the protein is made at increased 
concentration levels. Alternatively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 

25 deletions, as discussed below. 

The term "heterologous" when used with reference to portions of a nucleic acid 
indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated 

30 genes arranged to make a new functional nucleic acid, e.g., a promoter from one source and 
a coding region from another source. Similarly, a heterologous protein will often refer to 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 
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A "promoter 0 is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or 
5 repressor elements, which can be located as much as several thousand base pairs from the 
start site of transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is active under 
environmental or developmental regulation. The term "operably linked" refers to a 
functional linkage between a nucleic acid expression control sequence (such as a promoter, 

10 or array of transcription factor binding sites) and a second nucleic acid sequence, e.g., 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 

1 5 particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, 
or nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence 

20 under stringent hybridization conditions when that sequence is present in a complex mixture 
(e.g., total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under which a 
probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 

25 different in different circumstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in 
"Overview of principles of hybridization and the strategy of nucleic acid assays" in Tijssen 
(1993) Hybridizati on with Nucleic Probes (Laboratory Techniques in Biochemistry and 
Molecular Biology! (vol. 24) Elsevier. Generally, stringent conditions are selected to be 

30 about 5-1 0° C lower than the thermal melting point (T m ) for the specific sequence at a 
defined ionic strength pH. The T m is the temperature (under defined ionic strength, pH, 
and nucleic concentration) at which 50% of the probes complementary to the target 

hybridize to the target sequence at equilibrium (as the target sequences are present in 
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excess, at T m , 50% of the probes are occupied at equilibrium). Stringent conditions will be 
those in which the salt concentration is less than about 1 .0 M sodium ion, typically about 
0.01-1 .0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is 
at least about 30° C for short probes (e.g., about 10-50 nucleotides) and at least about 60° C 
5 for long probes (e.g., greater than about 50 nucleotides). Stringent conditions may also be 
achieved with the addition of destabilizing agents such as formamide. For selective or 
specific hybridization, a positive signal is typically at least two times background, 
preferably 10 times background hybridization. Exemplary stringent hybridization 
conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, incubating at 42° 

10 C, or, 5x SSC, 1 % SDS, incubating at 65° C, with wash in 0.2x SSC, and 0. 1% SDS at 65° 
C. For PCR, a temperature of about 36° C is typical for low stringency amplification, 
although annealing temperatures may vary between about 32-48° C depending on primer 
length. For high stringency PCR amplification, a temperature of about 62° C is typical, 
although high stringency annealing temperatures can range from about 50-65° C, depending 

15 on the primer length and specificity. Typical cycle conditions for both high and low 
stringency amplifications include a denaturation phase of 90-95° C for 30-120 sec, an 
annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guidelines for low and high stringency amplification reactions are provided, 
e.g., in Innis, et al. (1990) PCR Protocols: A Guide to Methods and Applications Academic 

20 Press, NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically 
25 hybridize under moderately stringent hybridization conditions. Exemplary "moderately 
stringent hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 
M NaCl, 1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is 
typically at least twice background. Alternative hybridization and wash conditions can be 
utilized to provide conditions of similar stringency. Additional guidelines for determining 
30 hybridization parameters are provided in numerous references, e.g., Ausubel, et al. (eds. 
1991 and supplements) Current Protocols in Molecular Biology Wiley. 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a cancer protein includes the determination of a parameter that is 
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indirectly or directly under the influence of the cancer protein or nucleic acid, e.g., a 
physiological, functional, physical, or chemical effect, such as the ability to decrease cancer. 
It includes ligand binding activity; cell viability; cell growth on soft agar; anchorage 
dependence; contact inhibition and density limitation of growth; cellular proliferation; 
5 cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
expression in cells undergoing metastasis; and other characteristics of cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 

1 0 increases or decreases a parameter that is indirectly or directly under the influence of a 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured, e.g., changes in spectroscopic 
characteristics (e.g., fluorescence, absorbance, refractive index), hydrodynamic (e.g., 
shape), chromatographic, or solubility properties for the protein, measuring inducible 

1 5 markers or transcriptional activation of the cancer protein, measuring binding activity or 
binding assays, e.g., binding to antibodies or other ligands, and measuring growth, cellular 
proliferation, cell viability, cellular transformation, growth factor or serum dependence, 
tumor specific marker levels, invasiveness into Matrigel, tumor growth and metastasis in 
vivo, mRNA and protein expression, and other characteristics of cancer cells. The 

20 functional effects can be evaluated by many means, e.g., microscopy for quantitative or 
qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, j3-gal, GFP, and 
the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody 

25 binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators," and "modulators" of cancer polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally 

30 block activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate 
the activity or expression of cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 
"Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
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sensitize, agonize, or up regulate cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of cancer proteins, e.g., versions with 
altered activity, as well as naturally occurring and synthetic ligands, antagonists, agonists, 
antibodies, small chemical molecules, and the like. Such assays for inhibitors and activators 
5 include, e.g., expressing the cancer protein in vitro, in cells, or cell membranes, applying 
putative modulator compounds, and then determining the functional effects on activity, as 
described above. Activators and inhibitors of cancer can also be identified by incubating 
cancer cells with the test compound and determining increases or decreases in the 
expression of 1 or more cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more 
10 cancer proteins, such as cancer proteins encoded by the sequences set out in Table 2 or SEQ 
IDNOs:59-116. 

Samples or assays comprising cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

1 5 inhibitors) are assigned a relative protein activity value of 1 00%. Inhibition of a 
polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a cancer polypeptide is achieved 
when the activity value relative to the control (untreated with activators) is about 110%, 
more preferably 150%, more preferably 200-500% (e.g., two to five fold higher relative to 

20 the control), more preferably 1 000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as cell viability, formation of foci, 
anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 
density limitation of growth, loss of growth factor or serum requirements, changes in cell 

25 morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., pp. 231-241 in Freshney (1994) Culture of Animal 
Cells a Manual of Basic Technique (2d ed.) Wiley-Liss. 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

30 "Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 

spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also 
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arise spontaneously or following exposure to a carcinogen, thereby mutating an endogenous 
gene. Transformation is associated with phenotypic changes, such as immortalization of 
cells, aberrant growth control, nonmorphological changes, and/or malignancy. See, 
Freshney (2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley- 
5 Liss. 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an 
antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, 
delta, epsilon, and mu constant region genes, as well as the myriad immunoglobulin 

10 variable region genes. Light chains are classified as either kappa or lambda. Heavy chains 
are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the 
immunoglobulin classes, IgG, IgM, IgA, IgD, and IgE, respectively. Typically, the antigen- 
binding region of an antibody or its functional equivalent will be most critical in specificity 
and affinity of binding. See Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 

15 An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. 

Each tetramer is composed of two identical pairs of polypeptide chains, each pair having 
one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (VjJ and variable heavy 

20 chain (V jj) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a 
dimer of Fab which itself is a light chain joined to Vjj-ChI by a disulfide bond. The 

25 F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge 
region, thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab 1 monomer is 
essentially Fab with part of the hinge region (see Paul (ed. 1999) Fundamental Immunolo gy 
(4th ed.) Raven. While various antibody fragments are defined in terms of the digestion of 
an intact antibody, one of skill will appreciate that such fragments may be synthesized de 

30 novo either chemically or by using recombinant DNA methodology. Thus, the term 
antibody, as used herein, also includes antibody fragments either produced by the 
modification of whole antibodies, or those synthesized de novo using recombinant DNA 
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methodologies (e.g., single chain Fv) or those identified using phage display libraries (see, 
e.g., McCafferty, et al. (1990) Nature 348:552-554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques known. See, e.g., Kohler and Milstein (1975) Nature 256:495- 
5 497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. (1985) pp. 77-96 in 

Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) 
Current Protocols in Immunology Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Goding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 

10 (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

invention. Also, transgenic mice, or other organisms such as other mammals, may be used 
to express humanized antibodies. Alternatively, phage display technology can be used to 
identify antibodies and heteromeric Fab fragments that specifically bind to selected 
antigens. See, e.g., McCafferty, et al. (1990) Nature 348:552-554; Marks, et al. (1992) 

15 Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, and/or species, or an 
entirely different molecule which confers new properties to the chimeric antibody, e.g., an 

20 enzyme, toxin, hormone, growth factor, drug, effector function, chemoattractant, immune 
modulator, etc.; or (b) the variable region, or a portion thereof, is altered, replaced, or 
exchanged with a variable region having a different or altered antigen specificity. 
Identification of cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different patient 

25 samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation 
of a number of genes simultaneously allows the generation of a gene expression profile that 
is characteristic of the state of the cell. That is, normal tissue may be distinguished from 

30 cancerous or metastatic cancerous tissue, or cancer tissue or metastatic cancerous tissue can 
be compared with tissue from surviving cancer patients. By comparing expression profiles 
of tissue in known different cancer states, information regarding which genes are important 
(including both up-and down-regulation of genes) in each of these states is obtained. 
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Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of a cancer. 

The identification of sequences that are differentially expressed in cancer versus 
non-cancer tissue allows the use of this information in a number of ways. For example, a 
5 particular treatment regime may be evaluated: does a chemotherapeutic drug act to down- 
regulate cancer, and thus tumor growth or recurrence, in a particular patient. Alternatively, 
a treatment step may induce other markers which may be used as targets to destroy tumor 
cells. Similarly, diagnosis and treatment outcomes may be done or confirmed by comparing 
patient samples with the known expression profiles. Maliganant disease may be compared 

10 to non-malignant conditions. Metastatic tissue can also be analyzed to determine the stage 
of cancer in the tissue, or origin of primary tumor, e.g., metastasis from a remote primary 
site. Furthermore, these gene expression profiles (or individual genes) allow screening of 
drug candidates with an eye to mimicking or altering a particular expression profile; e.g., 
screening can be done for drugs that suppress the cancer expression profile. This may be 

1 5 done by making biochips comprising sets of the important cancer genes, which can then be 
used in these screens. These methods can also be done on the protein basis; that is, protein 
expression levels of the cancer proteins can be evaluated for diagnostic puiposes or to 
screen candidate agents. In addition, the cancer nucleic acid sequences can be administered 
for gene therapy purposes, including the administration of antisense nucleic acids, or the 

20 cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that are 
differentially expressed in cancer relative to normal tissues and/or non-malignant disease, or 
in different types of related diseases, herein termed "cancer sequences." As outlined below, 

25 cancer sequences include those that are up-regulated (e.g., expressed at a higher level) in 
cancer, as well as those that are down-regulated (e.g., expressed at a lower level). In a 
preferred embodiment, the cancer sequences are from humans; however, cancer sequences 
from other organisms may be useful in animal models of disease and drug evaluation; thus, 
other cancer sequences are provided, from vertebrates, including mammals, including 

30 rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, 
goats, pigs, cows, horses, etc.) and pets (e.g., dogs, cats, etc.). Cancer sequences from other 
organisms may be obtained using the techniques outlined below. 
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Cancer sequences can include both nucleic acid and amino acid sequences. In a 
preferred embodiment, the skin cancer sequences are recombinant nucleic acids. These 
nucleic acid sequences are useful in a variety of applications, including diagnostic 
applications, which will detect naturally occurring nucleic acids, as well as screening 
5 applications; e.g., biochips comprising nucleic acid probes or PCR microtiter plates with 
selected probes to the cancer sequences. 

A cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence homology to the cancer sequences outlined herein. Such homology 
can be based upon the overall nucleic acid or amino acid sequence, and is generally 
1 0 determined as outlined below, e.g., using homology programs or hybridization conditions. 

For identifying cancer-associated sequences, the cancer screen typically includes 
comparing genes identified in different tissues, e.g., normal and cancerous tissues, cancer 
and non-malignant conditions, non-malignant conditions and normal tissues, or tumor tissue 
samples from patients who have metastatic disease vs. non metastatic tissue. Other suitable 
1 5 tissue comparisons include comparing cancer samples with metastatic cancer samples from 
other cancers, such as lung, stomach, gastrointestinal cancers, etc. Samples of different 
stages of cancer, e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, 
are applied to biochips comprising nucleic acid probes. The samples are first 
microdissected, if applicable, and treated for preparation of mRNA. Suitable biochips are 
20 commercially available, e.g., from Affymetrix, Santa Clara, CA. Gene expression profiles 
as described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal 
and disease states are compared to genes expressed in other normal tissues, including, and 
not limited to lung, heart, brain, liver, stomach, kidney, muscle, colon, small intestine, large 
25 intestine, spleen, bone, and/or placenta. In a preferred embodiment, those genes identified 
during the cancer screen that are expressed in a significant amount in other tissues (e.g., 
essential organs) are removed from the profile, although in some embodiments, this is not 
necessary (e.g., where organs may be dispensible, e.g., female or male specific). That is, 
when screening for drugs, it is usually preferable that the target expression be disease 
30 specific, to minimize possible side effects on other organs were there expression. 

In a preferred embodiment, cancer sequences are those that are up-regulated in 
cancer; that is, the expression of these genes is higher in the cancer tissue as compared to 
non-cancer or non-malignant tissue. "Up-regulation" as used herein often means at least 
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about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. Another embodiment is directed to sequences up- 
regulated in non-malignant conditions relative to normal. Uniformity among relevant 
samples is also preferred. 
5 Unigene cluster identification numbers and accession numbers herein are for the 

GenBank sequence database and the sequences of the accession numbers are hereby 
expressly incorporated by reference. GenBank is available, see, e.g., Benson, et al. (1998) 
Nuc. Acids Res. 26: 1 -7. Sequences are also available in other databases, e.g., European 
Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). In some 

10 situations, the sequences may be derived from assembly of available sequences or be 

predicted from genomic DNA using exon prediction algorithms, such as FGENESH. See 
Salamov and Solovyev (2000) Genome Res. 10:516-522. In other situations, sequences 
have been derived from cloning and sequencing of isolated nucleic acids. 

In another preferred embodiment, cancer sequences are those that are down- 

15 regulated in the cancer; that is, the expression of these genes is lower in cancer tissue as 
compared to non-cancerous tissue. "Down-regulation" as used herein often means at least 
about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. 
Informatics 

20 The ability to identify genes that are over or under expressed in cancer can 

additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with cancer or related diseases. See 

25 Tables 1-2. Or as another example, subcellular toxicological information can be generated 
to better direct drug structure and activity correlation (see Anderson (June 11-12, 1998) 
Pharmaceu tical Proteomics: Targets, Mechanism, and Function , paper presented at the IBC 
Proteomics conference, Coronado, CA). Subcellular toxicological information can also be 
utilized in a biological sensor device to predict the likely toxicological effect of chemical 

30 exposures and likely tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar 
advantages accrue from datasets relevant to other biomolecules and bioactive agents (e.g., 
nucleic acids, saccharides, lipids, drugs, and the like). 
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Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in a form in 
which data can be maintained and transmitted, but is preferably an electronic database. The 
5 electronic database of the invention can be maintained on any electronic device allowing for 
the storage of and access to the database, such as a personal computer, but is preferably 
distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is 
for clarity of illustration only. Similar databases can be assembled for assay data acquired 
1 0 using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing cancer, e.g., the identification of cancer-associated sequences described 
herein, provide an abundance of information which can be correlated with pathological 
1 5 conditions, predisposition to disease, drug testing, therapeutic monitoring, gene-disease 
causal linkages, identification of correlates of immunity and physiological status, among 
others. Although the data generated from the assays of the invention is suited for manual 
review and analysis, in a preferred embodiment, data processing using high-speed 
computers is utilized. 

20 An array of methods for indexing and retrieving biomolecular information is 

available. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records containing 

25 information in a format that allows a collection of partial-length DNA sequences to be 

catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of 

30 similarity between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a 
method using mass spectroscopy fragmentation patterns of peptides to identify amino acid 
sequences in computer databases by comparison of predicted mass spectra with 
experimentally-derived mass spectra using a closeness-of-fit measure. U.S. Patent 
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5,926,818 discloses a multi-dimensional database comprising a functionality for multi- 
dimensional data analysis described as on-line analytical processing (OLAP), which entails 
the consolidation of projected and actual data according to more than one consolidation path 
or dimension. U.S. Patent 5,295,261 reports a hybrid database structure in which the fields 
of each database record are divided into two classes, navigational and informational data, 
with navigational fields stored in a hierarchical topological map which can be viewed as a 
tree structure or as the merger of two or more such tree structures. See also Baxevanis, et 
al - ( 2001 ) Bioinformati cs: A Practical Guuide to the Analysis of Genes and Proteins Wiley; 
Mount (2001) Bioinformati cs: Sequence and Genome Analysis CSH Press, NY; Durbin, et 
al. (eds. 1999) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic 
Acids Cambridge University Press; Baxevanis and Oeullette (eds. 1998) Bioinformatics: A 
Practical Guide to the Ana lysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and 
Buehler (1 999) Bioinformat ics: Basic Applications in Biological S cience and Medici™ 
CRC Press; Setubal, et al. (eds. 1997) Introduction to Computational Molecular Biolo^ 
Brooks/Cole; Misener and Krawetz (eds. 2000) Bioinformatics: Methods and Protocols 
Humana Press; Higgins and Taylor (eds. 2000) Bioinformatics: Sequence, Structure, and 
Databanks: A Practical Approach Oxford University Press; Brown (2001) Bioinformatics: 
A Biologist's Guide to Biocomputing an d the Internet Eaton Pub.; Han and Kamber (2000) 
Data Mining: Concepts and Techniques Kaufinann Pub.; and Waterman (1995) Introduction 
to Computational Biolog y: Maps. Sequences, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 
least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for cancer. In another variation, the assay records 
cross-tabulate one or more of the following parameters for each target species in a sample: 
(1) a unique identification code, which can include, e.g., a target molecular structure and/or 
characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample source; 
and (3) absolute and/or relative quantity of the target species present in the sample. 
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The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
5 and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states 
or transistor gate states, such as an array of cells in a DRAM device (e.g., each cell 
comprised of a transistor and a charge storage area, which may be on the transistor). In one 
embodiment, the invention provides such storage devices, and computer systems built 
10 therewith, comprising a bit pattern encoding a protein expression fingerprint record 

comprising unique identifiers for at least 10 target data records cross-tabulated with target 
source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 

1 5 computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. 
The comparison can include a sequence analysis or comparison algorithm or computer 
program embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the 
comparison may be of the relative amount of a peptide or nucleic acid sequence in a pool of 

20 sequences determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard 
(fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of the 

25 invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 
line, wireless network, optical fiber, or other suitable signal transmission medium, whereby 

30 at least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 
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The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing 
a query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree 
of identity and gap weight to the target data. A central processor is preferably initialized to 
load and execute the computer program for alignment and/or comparison of the assay 
results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a 
selected assay characteristic (e.g., binding to a selected affinity moiety) and the same 
characteristic of the query target and results are output via an I/O device. For example, a 
central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial 
or public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, 
or other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer, (3) a comparison target, such as a query target; and 
(4) a program for alignment and comparison, typically with rank-ordering of comparison 
results on the basis of computed similarity values. See, e.g., Ewens and Grant (2001) 
Statistical Methods in Bioinformatics: An I ntroduction Springer-Verlag. Mathematical 

-30- 



WO 2004/073657 



PCT/US2004/005455 



approaches can also be used to conclude whether similarities or differences in the gene 
expression exhibited by different samples are significant. See, e.g., Golub, et al. (1999) 
Science 286:53 1-537; Duda, et al. (2001) Pattern Classification Wiley; and Hastie, et al. 
( 2001 ) The Elements of Statistical Learning: Data Mining. Inference, and Prediction 
Springer-Verlag. One approach to determine whether a sample is more similar to or has 
maximum similarity with a given condition between the sample and one or more pools 
representing different conditions for comparison; the pool with the smallest vector angle is 
then chosen as the most similar to the biological sample among the pools compared. 
Characteristics of cancer-associated proteins 

Cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins, or intracellular proteins. In one embodiment, the cancer protein is 
an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 
(including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts, et al. (eds. 1994) 
Molecular Biology of the Cell (3d ed.) Garland). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, 
protease activity, nucleotide cyclase activity, polymerase activity, and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of 
proteins, or targeting proteins to various subcellular localizations, and are involved in 
maintaining the structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. 
In addition to the highly conserved sequences found in the enzymatic domain of proteins, 
highly conserved sequences have been identified in proteins that are involved in protein- 
protein interaction. For example, Src-homology-2 (SEE) domains bind tyrosine- 
phosphorylated targets in a sequence dependent manner. PTB domains, which are distinct 
from SH2 domains, also bind tyrosine phosphoiylated targets. SH3 domains bind to 
proline-rich targets. In addition, PH domains, tetratricopeptide repeats and WD domains to 
name only a few, have been shown to mediate protein-protein interactions. Some of these 
may also be involved in binding to phospholipids or other second messengers. These motifs 
can be identified on the basis of amino acid sequence; thus, an analysis of the sequence of 
proteins may provide insight into both the enzymatic potential of the molecule and/or 
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molecules with which the protein may associate. One useful database is Pfam (protein 
families), which is a large collection of multiple sequence alignments and hidden Markov 
models covering many common protein domains. Versions are available via the internet 
from Washington University in St. Louis, the Sanger Center in England, and the Karolinska 
5 Institute in Sweden. See, e.g., Bateman, et al. (2000) Nuc. Acids Res. 28:263-266; 

Sonnhammer, et al. (1997) Proteins 28:405-420 ; Bateman, et al. (1999) Nuc. Acids Res. 
27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320-322. 

In another embodiment, the cancer sequences are transmembrane proteins. 
Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 

10 have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 

1 5 kinases have both protein kinase activity and SH2 domains. In addition, 

autophosphorylation of tyrosines on the receptor molecule itself, creates binding sites for 
additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. 
For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl 

20 cyclases and receptor serine/threonine protein kinases contain a single transmembrane 

domain. However, various other proteins including channels and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G 
protein coupled receptors (GPCRs) are classified as "seven transmembrane domain" 
proteins, as they contain 7 membrane spanning regions. Characteristics of transmembrane 

25 domains include approximately 17 consecutive hydrophobic amino acids that may be 

followed by charged amino acids. Therefore, upon analysis of the amino acid sequence of a 
particular protein, the localization and number of transmembrane domains within the 
protein may be predicted (see, e.g., PSORT web site http://psort.nibb.ac.jp/). Important 
transmembrane protein receptors include, but are not limited to the insulin receptor, insulin- 

30 like growth factor receptor, human growth hormone receptor, glucose transporters, 

transferrin receptor, epidermal growth factor receptor, low density lipoprotein receptor, 
epidermal growth factor receptor, leptin receptor, and interleukin receptors, e.g., IL-1 
receptor, IL-2 receptor, etc. 
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The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, 
5 extracellular domains are found on receptors. Factors that bind the receptor domain include 
circulating ligands, which may be peptides, proteins, or small molecules such as adenosine 
and the like. For example, growth factors such as EGF, FGF, and PDGF are circulating 
growth factors that bind to their cognate receptors to initiate a variety of cellular responses. 
Other factors include cytokines, mitogenic factors, neurotrophic factors, and the like. 

10 Extracellular domains also bind to cell-associated molecules. In this respect, they may 

mediate cell-cell interactions. Cell-associated ligands can be tethered to the cell, e.g., via a 
glycosylphosphatidylinositol (GPI) anchor, or may themselves be transmembrane proteins. 
Extracellular domains may also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

1 5 Cancer proteins that are transmembrane are particularly preferred in the present 

invention as they are readily accessible targets for immunotherapeutics, as are described 
herein. In addition, as outlined below, transmembrane proteins can be also useful in 
imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples 

20 are typically permeablized to provide access to intracellular proteins. In addition, some 
membrane proteins can be processed to release a soluble protein, or to expose a residual 
fragment. Released soluble proteins may be useful diagnostic markers, processed residual 
protein fragments may be useful lung markers of disease. 

It will also be appreciated that a transmembrane protein can be made soluble by 

25 removing transmembrane sequences, e.g., through recombinant methods. Furthermore, 
transmembrane proteins that have been made soluble can be made to be secreted through 
recombinant means by adding an appropriate signal sequence. 

In another embodiment, the cancer proteins are secreted proteins; the secretion of 
which can be either constitutive or regulated. These proteins may have a signal peptide or 

30 signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 
signals to various other cell types. The secreted protein may function in an autocrine 
manner (acting on the cell that secreted the factor), a paracrine manner (acting on cells in 
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close proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 
distance, e.g, secretion into the blood stream), or exocrine (secretion, e.g., through a duct or 
to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 
glands, mammary glands, wax producing glands of the ear, etc.). Thus secreted molecules 

5 often find use in modulating or altering numerous aspects of physiology. Cancer proteins 
that are secreted proteins are particularly preferred in the present invention as they serve as 
good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. Those 
which are enzymes may be antibody or small molecule targets. Others may be useful as 
vaccine targets, e.g., via CTL mechanisms. 

1 0 Use of cancer nucleic acids 

As described above, cancer sequence is initially identified by substantial nucleic 
acid and/or amino acid sequence homology or linkage to the cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 

15 hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

As detailed elsewhere, percent identity can be determined using an algorithm such as 
BLAST. A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the 
default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively. 
20 Alignment may include the introduction of gaps in the sequences to be aligned. In addition, 
for sequences which contain either more or fewer nucleotides than those of the nucleic acids 
described, the percentage of homology may be determined based on the number of 
homologous nucleosides in relation to the total number of nucleosides. Thus, e.g., 
homology of sequences shorter than those of the sequences identified will be determined 
25 using the number of nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through hybridization 
studies. Thus, e.g., nucleic acids which hybridize under high stringency to a described 
nucleic acid, or its complement, or is also found on naturally occurring mRNAs is 
considered a cancer sequence. In another embodiment, less stringent hybridization 
30 conditions are used; e.g., moderate or low stringency conditions may be used; see Ausubel, 
supra, and Tijssen, supra. 

The cancer nucleic acid sequences of the invention, e.g., the sequences in Table 3, 
can be fragments of larger genes, e.g., they are nucleic acid segments. "Genes 11 in this 
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context includes coding regions, non-coding regions, and mixtures of coding and non- 
coding regions. Accordingly, using the sequences provided herein, extended sequences, in 
either direction, of the cancer genes can be obtained, using techniques well known for 
cloning either longer sequences or the full length sequences; see Ausubel, et al., supra. 
Much can be done by informatics and many sequences can be clustered to include multiple 
sequences corresponding to a single gene, e.g., systems such as UniGene (see, UniGene 
database at the NCBI web-site). 

Once a cancer nucleic acid is identified, it can be cloned and, if necessary, its 
constituent parts recombined to form the entire cancer nucleic acid coding regions or the 
entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant cancer nucleic acid can be further used as a probe to identify and isolate other 
cancer nucleic acids, e.g., extended coding regions. It can also be used as a "precursor" 
nucleic acid to make modified or variant cancer nucleic acids and proteins. 

The cancer nucleic acids of the present invention are used in several ways. In one 
embodiment, nucleic acid probes to the cancer nucleic acids are made and attached to 
biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, vaccine, RNAi, and/or antisense applications. 
Alternatively, cancer nucleic acids that include coding regions of cancer proteins can be put 
into expression vectors for the expression of cancer proteins, again for screening purposes 
or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to cancer nucleic acids (both the 
nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the cancer nucleic acids, e.g., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
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sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 
partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes 
range from about 8-100 bases long, with from about 10-80 bases being preferred, and from 
about 30-50 bases being particularly preferred. That is, generally whole genes are not used. 
In some embodiments, much longer nucleic acids can be used, up to hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for 
a particular target The probes can be overlapping (e.g., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher 
sensitivity. 

Nucleic acids can be attached or immobilized to a solid support in a wide variety of 
ways. By "immobilized" and grammatical equivalents herein is meant the association or 
binding between the nucleic acid probe and the solid support is sufficient to be stable under 
the conditions of binding, washing, analysis, and removal as outlined. The binding can 
typically be covalent or non-covalent. By "non-covalent binding" and grammatical 
equivalents herein is meant one or more of electrostatic, hydrophilic, and hydrophobic 
interactions. Included in non-covalent binding is the covalent attachment of a molecule, 
e.g., streptavidin to the support and the non-covalent binding of the biotinylated probe to the 
streptavidin. By "covalent binding" and grammatical equivalents herein is meant that the 
two moieties, the solid support and the probe, are attached by at least one bond, including 
sigma bonds, pi bonds, and coordination bonds. Covalent bonds can be formed directly 
between the probe and the solid support or can be formed by a cross linker or by inclusion 
of a specific reactive group on either the solid support or the probe or both molecules. 
Immobilization may also involve a combination of covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways. As 
described herein, the nucleic acids can either be synthesized first, with subsequent 
attachment to the biochip, or can be directly synthesized on the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support- 
er other grammatical equivalents herein is meant a material that can be modified for the 
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attachment or association of the nucleic acid probes and is amenable to at least one 
detection method. Often, the substrate may contain discrete individual sites appropriate for 
individual partitioning and identification. The number of possible substrates is very large, 
and include, but are not limited to, glass and modified or functionalized glass, plastics 
(including acrylics, polystyrene and copolymers of slyrene and other materials, 
polypropylene, polyethylene, polybutylene, polyurethanes, TeflonJ, etc.), polysaccharides, 
nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified 
silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the substrates allow 
optical detection and do not appreciably fluoresce. See WO 0055627. 

Generally the substrate is planar, although other configurations of substrates may be 
used as well. For example, the probes may be placed on the inside surface of a tube for 
flow-through sample analysis to minimize sample volume. Similarly, the substrate may be 
flexible, such as a flexible foam, including closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, 
e.g., the biochip is derivatized with a chemical functional group including, but not limited 
to, amino groups, carboxy groups, oxo groups, and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can 
be attached to surfaces comprising ammo groups, e.g., using linkers; e.g., homo-or hetero- 
bifunctional tinkers as are well known (see 1 994 Pierce Chemical Company catalog, 
technical section on cross-linkers, pages 155-200). In addition, in some cases, additional 
linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the 
surface of the solid support. Either the 5' or 3' terminus may be attached to the solid 
support, or attachment may be via linkage to an internal nucleoside. In another 
embodiment, the immobilization to the solid support may be very strong, yet non-covalent. 
For example, biotinylated oligonucleotides can be made, which bind to surfaces covalently 
coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface. For example, 
photoactivation techniques utilizing photopolymerization compounds and techniques are 
used. In a preferred embodiment, die nucleic acids can be synthesized in situ, using known 
photolithographic techniques, such as those described in WO 95/251 16; WO 95/35505; U.S. 
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Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of which are expressly 
incorporated by reference; these methods of attachment form the basis of the Affymetrix 

GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
cancer-associated sequences. These assays are typically performed in conjunction with 
reverse transcription. In such assays, a cancer-associated nucleic acid sequence acts as a 
template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In a 
quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
measure of the amount of cancer-associated RNA. Methods of quantitative amplification 
are well known. Detailed protocols for quantitative PCR are provided, e.g., in Innis, et al. 
(1990) PCR Protocols: A Guide to Method s and A pplications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 1 end. When the PCR product is amplified in 
subsequent cycles, the 5* nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
amplification (see, e.g., literature provided by Perkin-Elmer at their public web site). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560-569, Landegren, et al. (1988) 
Science 241:1077-1080, and Barringer, et al. (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc.Natl. Acad. Sci. USA 86:1173-1177), self- 
sustained sequence replication (Guatelli, et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874- 
1878), dot PCR, linker adapter PCR, etc. 
Expression of cancer proteins from nucleic acids 

In a preferred embodiment, cancer nucleic acids, e.g., encoding cancer proteins, are 
used to make a variety of expression vectors to express cancer proteins which can then be 
used in screening assays, as described below. Expression vectors and recombinant DNA 
technology are well known (see, e.g., Ausubel, supra, and Fernandez and Hoeffler (eds. 
1999) Gene Expression Systems Academic Press) to express proteins. The expression 
vectors may be either self-replicating extrachromosomal vectors or vectors which integrate 
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into a host genome. Generally, these expression vectors include transcriptional and 
translational regulatory nucleic acid operably linked to the nucleic acid encoding the cancer 
protein. The term "control sequences" refers to DNA sequences used for the expression of 
an operably linked coding sequence in a particular host organism. Control sequences that 
are suitable for prokaryotes, e.g., include a promoter, optionally an operator sequence, and a 
ribosome binding site. Eukaryotic cells are known to utilize promoters, polyadenylation 
signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship 
with another nucleic acid sequence. For example, DNA for a presequence or secretory 
leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to 
a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the cancer protein. 
Numerous types of appropriate expression vectors and suitable regulatory sequences are 
known for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but 
are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known, and are 
useful in the present invention. 

An expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in 
two organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host 
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for cloning and amplification. Furthermore, for integrating expression vectors, the 
expression vector often contains at least one sequence homologous to the host cell genome, 
and preferably two homologous sequences which flank the expression construct. The 
integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are available. See, e.g., Fernandez and Hoeffler, supra; and Kitamura, et al. (1995) 
Proc. Nafl A cad. Sci. USA Q?Q14fi-Q1 sn 

In addition, in a preferred embodiment, the expression vector contains a selectable 
marker gene to allow the selection of transformed host cells. Selection genes are well 
known and will vary with the host cell used. 

The cancer proteins of the present invention are usually produced by culturing a host 
cell transformed with an expression vector containing nucleic acid encoding a cancer 
protein, under the appropriate conditions to induce or cause expression of the cancer 
protein. Conditions appropriate for cancer protein expression will vary with the choice of 
the expression vector and the host cell, and will be easily ascertained through routine 
experimentation or optimization. For example, the use of constitutive promoters in the 
expression vector will require optimizing the growth and proliferation of the host cell, while 
the use of an inducible promoter requires the appropriate growth conditions for induction. 
In addition, in some embodiments, the timing of the harvest is important For example, the 
baculoviral systems used in insect cell expression are lytic viruses, and thus harvest time 
selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line), and various other human cells and cell lines. 

In a preferred embodiment, the cancer proteins are expressed in mammalian cells. 
Mammalian expression systems may be used, and include retroviral and adenoviral systems. 
One expression vector system is a retroviral vector system such as is generally described in 
PCT/US97/01019 and PCT/US97/01048. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 
and have a broad host range. Examples include the S V40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
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and the CMV promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter 
elements, flank the coding sequence. Examples of transcription terminator and 

5 polyadenlyation signals include those derived from SV40. 

Methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, are available, and will vary with the host cell used. Techniques include 
dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 

10 polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used. In addition, synthetic promoters and 
hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp .and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 

1 5 promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase 
and initiate transcription. In addition to a functioning promoter sequence, an efficient 
ribosome binding site is desirable. The expression vector may also include a signal peptide 
sequence that provides for secretion of the cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, 

20 located between the inner and outer membrane of the cell (gram-negative bacteria). The 
bacterial expression vector may also include a selectable marker gene to allow for the 
selection of bacterial strains that have been transformed. Suitable selection genes include 
genes which render the bacteria resistant to drugs such as ampicillin, chloramphenicol, 
erythromycin, kanamycin, neomycin, and tetracycline. Selectable markers also include 

25 biosynthetic genes, such as those in the histidine, tryptophan, and leucine biosynthetic 
pathways. These components are assembled into expression vectors. Expression vectors 
for bacteria are well known, and include vectors for Bacillus subtilis, E. coli, Streptococcus 
cremoris, and Streptococcus lividans, among others (e.g., Fernandez and Hoeffler, supra). 
The bacterial expression vectors are transformed into bacterial host cells using techniques 

30 such as calcium chloride treatment, electroporation, and others. 

In one embodiment, cancer proteins are produced in insect cells using, e.g., 
expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors. 
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In a preferred embodiment, a cancer protein is produced in yeast cells. Yeast 
expression systems are well known, and include expression vectors for Saccharomyces 
cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, Kluyveromyces 
ftagilis and K. lactis, Pichia guillerimondii and P. pastoris, Schizosaccharomyces pombe, 
and Yarrowia lipolytica. 

The cancer protein may also be made as a fusion protein, using available techniques. 
Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope is small, the 
cancer protein may be fused to a carrier protein to fonn an immunogen. Alternatively, the 
cancer protein may be made as a fusion protein to increase expression, or for other reasons. 
For example, when the cancer protein is a cancer peptide, the nucleic acid encoding the 
peptide may be linked to other nucleic acid for expression purposes. Fusion with detection 
epitope tags can be made, e.g., with FLAG, His6, myc, HA, etc. 

In a preferred embodiment, the cancer protein is purified or isolated after expression. 
Cancer proteins may be isolated or purified in a variety of ways depending on what other 
components are present in the sample and the requirements for purified product, e.g., natural 
conformation or denatured. Standard purification methods include ammonium sulfate 
precipitations, electrophoretic, molecular, immunological, and chromatographic techniques, 
including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, 
and chromatofocusing. For example, the cancer protein may be purified using a standard 
anti-cancer protein antibody column. Ultrafiltration and diafiltration techniques, in 
conjunction with protein concentration, are also useful. See, e.g., Walsh (2002) Proteins: 
Biochemistry and Biotechnolo g y Wiley; Hardin, et al. (eds. 2001) Cloning. Gene 
Expression and Protein Purification Oxford Univ. Press; Wilson, et al. (eds. 2000) 
Encyclopedia of Separation Sripnr.* Academic Press; and Scopes (1993) Protein 
Purification Springer-Verlag. The degree of purification necessary will vary depending on 
the use of the cancer protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the cancer proteins and nucleic acids are 
useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 
Variants of cancer proteins 

Also included within one embodiment of cancer proteins are amino acid variants of 
the naturally occurring sequences, as determined herein. Preferably, the variants are 
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preferably greater than about 75% homologous to the wild-type sequence, more preferably 
greater than about 80%, even more preferably greater than about 85%, and most preferably 
greater than 90%. In some embodiments the homology will be as high as about 93-95% or 
98%. As for nucleic acids, homology in this context means sequence similarity or identity, 
5 with identity being preferred. This homology will be determined using standard techniques, 
as are outlined above for nucleic acid homologies. 

Cancer proteins of the present invention may be shorter or longer than the wild type 
amino acid sequences. Thus, in a preferred embodiment, included within the definition of 
cancer proteins are portions or fragments of the wild type sequences herein. In addition, as 

1 0 outlined above, the cancer nucleic acids of the invention may be used to obtain additional 
coding regions, and thus additional protein sequence. 

In one embodiment, the cancer proteins are derivative or variant cancer proteins as 
compared to the wild-type sequence. That is, as outlined more fully below, the derivative 
cancer peptide will often contain at least one amino acid substitution, deletion, or insertion, 

1 5 with amino acid substitutions being particularly preferred. The amino acid substitution, 
insertion, or deletion may occur at many residue positions within the cancer peptide. 

Also included within one embodiment of cancer proteins of the present invention are 
amino acid sequence variants. These variants typically fall into one or more of three 
classes: substitutional, insertional, or deletional variants. These variants ordinarily are 

20 prepared by site specific mutagenesis of nucleotides in the DNA encoding the cancer 

protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 
the variant, and thereafter expressing the DNA in recombinant cell culture as outlined 
above. However, variant cancer protein fragments having up to about 100-1 50 residues 
may be prepared by in vitro synthesis using established techniques. Amino acid sequence 

25 variants are characterized by the predetermined nature of the variation, a feature that sets 
them apart from naturally occurring allelic or interspecies variation of the cancer protein 
amino acid sequence. The variants typically exhibit a similar qualitative biological activity 
as a naturally occurring analogue, although variants can also be selected which have 
modified characteristics. 

30 While the site or region for introducing an amino acid sequence variation is often 

predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed cancer variants screened for the 
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optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., Ml 3 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be 
on the order of from about 1-20 amino acids, although considerably larger insertions may be 
tolerated. Deletions generally range from about 1-20 residues, although in some cases 
deletions may be much larger. 

Substitutions, deletions, insertions, or combination thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships described. 

The variants typically exhibit essentially the same qualitative biological activity and 
will elicit the same immune response as a naturally-occurring analog, although variants also 
are selected to modify the characteristics of cancer proteins as needed. Alternatively, the 
variant may be designed such that a biological activity of the cancer protein is altered. For 
example, glycosylation sites may be added, altered, or removed. 

Substantial changes in function or immunological identity are sometimes made by 
selecting substitutions that are less conservative man those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
Substitutions which generally are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g., serine or threone 
is substituted for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, 
valine, or alanine; (b) a cysteine or proline is substituted for (or by) another residue; (c) a 
residue having an electropositive side chain, e.g., lysine, arginine, or histidine, is substituted 
for (or by) an electronegative residue, e.g., glutamic or aspartic acid; (d) a residue having a 
bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, 
e.g., glycine; or (e) a proline residue is incorporated or substituted, which changes the 
degree of rotational freedom of the peptidyl bond. 
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Variants typically exhibit a similar qualitative biological activity and will elicit the 
same immune response as the naturally-occurring analog, although variants also are selected 
to modify the characteristics of the skin cancer proteins as needed. Alternatively, the 
variant may be designed such that the biological activity of the cancer protein is altered. 
5 For example, glycosylation sites may be altered or removed. 

Covalent modifications of cancer polypeptides are included within the scope of this 
invention. One type of covalent modification includes reacting targeted amino acid residues 
of a cancer polypeptide with an organic derivatizing agent that is capable of reacting with 
selected side chains or the N-or C-terminal residues of a cancer polypeptide. Derivatization 
10 with bifunctional agents is useful, for instance, for crosslinking cancer polypeptides to a 
water-insoluble support matrix or surface for use in a method for purifying anti-cancer 
polypeptide antibodies or screening assays, as is more fully described below. Commonly 
used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, 
N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
15 imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), 
bifunctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl-3- 
((p-azidophenyl)ditfao)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to 
the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
20 lysine, phosphorylation of hydroxyl groups of serinyl, threonyl, or tyrosyl residues, 

methylation of the amino groups of the lysine, arginine, and histidine side chains (e.g., pp. 
79-86, Creighton (1992) Proteins: Structure and Molecular Properties Freeman), acetylation 
of the N-terminal amine, and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the cancer polypeptide included within the 
25 scope of this invention comprises altering the native glycosylation pattern of the 

polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence cancer polypeptide. Glycosylation patterns can be altered in many ways. 
30 Different cell types to express cancer-associated sequences can result in different 
glycosylation patterns. 

Addition of glycosylation sites to cancer polypeptides may also be accomplished by 
altering the amino acid sequence thereof. The alteration may be made, e.g., by the addition 
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of, or substitution by, one or more serine or threonine residues to the native sequence cancer 
polypeptide (for O-linked glycosylation sites). The cancer amino acid sequence may 
optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the cancer polypeptide at preselected bases such that codons are generated that 
will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the cancer 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. See, 
e.g., WO 87/05330; pp. 259-306 in Aplin and Wriston (1981) CRC Crit. Rev. Rinchsm 

Removal of carbohydrate moieties present on the cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are applicable. See, e.g., Sojar and Bahl (1987) Arch. Biochem. Bionhvs. 
259:52-57 and Edge, et al. (1981) Anal. BiocheTti. 1 18:131-137. Enzymatic cleavage of 
carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo-and 
exo-glycosidases. See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

Another type of covalent modification of cancer comprises linking the cancer 
polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Cancer polypeptides of the present invention may also be modified in a way to form 
chimeric molecules comprising a cancer polypeptide fused to another heterologous 
polypeptide or amino acid sequence. In one embodiment, such a chimeric molecule 
comprises a fusion of a cancer polypeptide with a tag polypeptide which provides an 
epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally 
placed at the amino-or carboxyl-terminus of the cancer polypeptide. The presence of such 
epitope-tagged forms of a cancer polypeptide can be detected using an antibody against the 
tag polypeptide. Also, provision of the epitope tag enables the cancer polypeptide to be 
readily purified by affinity purification using an anti-tag antibody or another type of affinity 
matrix that binds to the epitope tag. In an alternative embodiment, the chimeric molecule 
may comprise a fusion of a cancer polypeptide with an immunoglobulin or a particular 
region of an immunoglobulin. For a bivalent form of the chimeric molecule, such a fusion 
could be to the Fc region of an IgG molecule. 
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Various tag polypeptides and their respective antibodies are available. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and 
metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al (1988) 
M ° 1CeU Bio] 8:21 59-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 9E10 
antibodies thereto (Evan, et al. (1985) Molecular ^ G ft ll„w Biology 5:3610-3616); and 
the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al (1990) 
Protein Eng ineering 3(6): 547-553). Other tag polypeptides include the Flag-peptide (Hopp 
et al. (1988) BjoT^mology 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) ' 
Science 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J^ioLChem 
266.15163-15166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) 
Proc. Natl. Ac ad. Sci. TTSA 87:6393-6397). 

Also included are other cancer proteins of the cancer family, and cancer proteins 
from other organisms, which are cloned and expressed as outlined below. Thus, probe or 
degenerate polymerase chain reaction (PGR) primer sequences may be used to find other 
related cancer proteins from humans or other organisms. Particularly useful probe and/or 
PCR primer sequences include the unique areas of the cancer nucleic acid sequence 
Preferred PCR primers are from about 15-35 nucleotides in length, with from about 20-30 
bemg preferred, and may contain inosine as needed. The conditions for PCR reaction have 
been well described (e.g., Innis, PCR Protocols, supra). 

In addition, cancer proteins can be made that are longer than those encoded by the 
nucleic acids of Table 2 or the attached listing of SEQ ID NOs: 1-58, e.g., by the elucidation 
of extended sequences, the addition of epitope or purification tags, the addition of other 
fusion sequences, etc. 

Cancer proteins may also be identified as being encoded by cancer nucleic acids 
Thus, cancer proteins are encoded by nucleic acids that will hybridize to the sequences of 
the sequence listings, or their complements, as outlined herein. 
Antibodies to cancer proteins 

In a preferred embodiment, when the cancer protein is to be used to generate 
antibodies, e.g., for immunotherapy or immunodiagnosis, the cancer protein should share at 
least one epnope or determinant with the foil length protein. By "epitope" or "determinant" 
herem is typically meant a portion of a protein which will generate and/or bind an antibody 
or T-cell receptor in the context of MHC. Thus, in most instances, antibodies made to a 
smaller cancer protein will be able to bind to the foil-length protein, particularly linear 
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epitopes. In a preferred embodiment, the epitope is unique; that is, antibodies generated to a 
unique epitope show little or no cross-reactivity. In a preferred embodiment, the epitope is 
selected from a protein sequence set out in the Table 2 or the attached listing of SEQ ID 
NOs:59-116. 

5 Methods of preparing polyclonal antibodies exist (e.g., Coligan, supra; and Harlow 

and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or more 
injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing 
( agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or 
intraperitoneal injections. The immunizing agent may include a protein encoded by a 
10 nucleic acid of Table 2 or SEQ ID NOs: 1-58 or fragment thereof or a fusion protein thereof. 
It may be useful to conjugate the immunizing agent to a protein known to be immunogenic 
in the mammal being immunized. Examples of such immunogenic proteins include but are 
not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and 
soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
1 5 complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). Various immunization protocols may be used. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler 
and Milstein (1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other 
20 appropriate host animal, is typically immunized with an immunizing agent to elicit 

lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 
immunizing agent will typically include a polypeptide encoded by a nucleic acid of Table 2 
or the attached listing of SEQ ID NOs:l-58, or fragment thereof, or a fusion protein thereof. 
25 Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin 
are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (e.g., pp. 59-103 in 
Goding (1986) Monoclonal Antibodies: Principles and Practice Academic Press). 
30 Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine, or human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the 
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unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies 
are monoclonal, preferably human or humanized, antibodies that have binding specificities 
for at least two different antigens or that have binding specificities for two epitopes on the 
same antigen. In one embodiment, one of the binding specificities is for a protein encoded 
by a nucleic acid of Table 2 or the attached listing of SEQ ID NOs:l-58, or a fragment 
thereof, the other one is for another antigen, and preferably for a cell-surface protein or 
receptor or receptor subunit, preferably one that is tumor specific. Alternatively, tetramer- 
type technology may create multivalent reagents. 

In a preferred embodiment, the antibodies to cancer protein are capable of reducing 
or eliminating a biological function of a cancer protein, in a naked form or conjugated to an 
effector moiety, as is described below. That is, the addition of anti-cancer protein 
antibodies (either polyclonal or preferably monoclonal) to cancer tissue (or cells containing 
cancer) may reduce or eliminate the cancer. Generally, at least a 25% decrease in activity, 
growth, size, or the like is preferred, with at least about 50% being particularly preferred 
and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 
Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary determining 
region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat, or rabbit having the desired specificity, affinity, and 
capacity. In some instances, Fv framework residues of a human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. In general, a humanized antibody will comprise substantially all of 
at least one, and typically two, variable domains, in which all or substantially all of die CDR 
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regions correspond to those of a non-human immunoglobulin and all or substantially all of 
the framework (FR) regions are those of a human immunoglobulin consensus sequence. 
The humanized antibody optimally also will typically comprise at least a portion of an 
immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 

5 al. (1986) Nature 321:522-525; Riechmann, et al. (1988) Nature 332:323-329; and Presta 
(1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be essentially performed 
following the method of Winter and co-workers (Jones, et al. (1986) Nature 321 :522-525; 
Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239: 1534- 
1536), by substituting rodent CDRs or CDR sequences for corresponding sequences of a 

10 human antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. 
Patent No. 4,816,567), wherein substantially less than an intact human variable domain has 
been substituted by corresponding sequence from a non-human species. 

Human antibodies can also be produced using phage display libraries (Hoogenboom 
and Winter (1992) J. Mol. Biol. 227:381-388; Marks, et al. (1991) J. Mol. Biol. 222:581- 

15 597) or human monoclonal antibodies (e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) 
Monoclonal Antibodies and Cancer Therapy Liss; and Boerner, et al. (1991) J. Immunol. 
147:86-95). Similarly, human antibodies can be made by introducing human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

20 human antibody production is observed, which closely resembles that seen in humans in 
nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 

25 Nature 368:812-13; Fishwild, et al. (1996) Nature Biotechnology 14:845-851; Neuberger 
(1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. 
Immunol. 13:65-93. 

By immunotherapy is meant treatment of cancer with an antibody raised against 
cancer proteins. As used herein, immunotherapy can be passive or active. Passive 

30 immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 
Active immunization is the induction of antibody and/or T-cell responses in a recipient 
(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
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polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 
expression of the antigen, leading to an immune response. 

In a preferred embodiment the cancer proteins against which antibodies are raised 
are secreted proteins as described above. Without being bound by theory, antibodies used 
for treatment may bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted cancer protein, e.g., in autocrine signaling. 

In another preferred embodiment, the cancer protein to which antibodies are raised is 
a transmembrane protein. Without being bound by theory, antibodies used for treatment 
may bind the extracellular domain of the cancer protein and prevent it from binding to other 
proteins, such as circulating ligands or cell-associated molecules. The antibody may cause 
down-regulation of the transmembrane cancer protein. The antibody may be a competitive, 
non-competitive or uncompetitive inhibitor of protein binding to the extracellular domain of 
the cancer protein. The antibody may also be an antagonist of the cancer protein. Further, 
the antibody may prevent activation of the transmembrane cancer protein, or may induce or 
suppress a particular cellular pathway. In one aspect, when the antibody prevents the 
binding of other molecules to the cancer protein, the antibody prevents growth of the cell. 
The antibody may also be used to target or sensitize the cell to cytotoxic agents, including, 
but not limited to TNF-cc, TNF-p\ IL-1, INF- Y , and IL-2, or chemotherapeutic agents 
including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, and the like. In some 
instances the antibody may belong to a sub-type that activates serum complement when 
complexed with the transmembrane protein thereby mediating cytotoxicity or antigen- 
dependent cytotoxicity (ADCC). Thus, cancer may be treated by administering to a patient 
antibodies directed against the transmembrane cancer protein. Antibody-labeling may 
activate a co-toxin, localize a toxin payload, target a drug loaded liposome, or otherwise 
provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as 
radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
therapeutic moiety is a small molecule that modulates the activity of a cancer protein. In 
another aspect the therapeutic moiety may modulate the activity of molecules associated 
with or in close proximity to a cancer protein. The therapeutic moiety may inhibit 
enzymatic or signaling activity such as protease or collagenase or protein kinase activity 
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associated with cancer, or be an attractant of other cells, such as NK cells. See, e.g., USSN 
09/544,494. - 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to cancer tissue or cells results in a reduction in 
the number of afflicted cells, thereby reducing symptoms associated with cancer. Cytotoxic 
agents are numerous and varied and include, but are not limited to, cytotoxic drugs or toxins 
or active fragments of such toxins. Suitable toxins and their corresponding fragments 
include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, crotin, 
phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached 
to the antibody. Targeting the therapeutic moiety to transmembrane cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the cancer afflicted 
area, but also serves to reduce deleterious side effects that may be associated with the 
untargeted therapeutic moiety. Antibody fragments may be used to target toxin loaded 
liposomes. 

In another preferred embodiment, the cancer protein against which the antibodies are 
raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
which facilitates entry into the cell. In one case, the antibody enters the cell by endocytosis. 
In another embodiment, a nucleic acid encoding the antibody is administered to the 
individual or cell. Moreover, wherein the cancer protein can be targeted within a cell, e.g., 
the nucleus, an antibody thereto may contain a signal for that target localization, e.g., a 
nuclear localization signal. 

The cancer antibodies of the invention specifically bind to cancer proteins. By 
"specifically bind" herein is meant that the antibodies bind to the protein with a of at 
least about 0.1 mM, more usually at least about 1 /xM, preferably at least about 0.1 (M. or 
better, and most preferably, 0.01 or better. Selectivity of binding to the specific target 
and not to related sequences is often also important. 
Detection of cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing cancer) and in cancer tissue (and in some cases, for varying severities of 
cancer that relate to prognosis, as outlined below), or in non-malignant disease are evaluated 
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to provide expression profiles. A gene expression profile of a particular cell state or point 
of development is essentially a "fingerprint" of the state of the cell. While two states may 
have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the 
5 state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

10 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus cancer tissue. Genes may be turned on or turned off in a particular state, 

15 relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, 
but not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; e.g., gene expression is either upregulated, resulting in 

20 an increased amount of transcript, or downregulated, resulting in a decreased amount of 

transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays. See, Lockhart (1996) Nature Biotechnology 14:1675-1680. 
Other techniques include, but are not limited to, quantitative reverse transcriptase PCR, 

25 northern analysis, and RNase protection. As outlined above, preferably the change in 

expression (e.g., upregulation or downregulation) is at least about 50%, more preferably at 
least about 100%, more preferably at least about 150%, more preferably at least about 
200%, wife from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript or the protein level. The amount of gene 

30 expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of 
the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
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spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to cancer 
genes, e.g., those identified as being important in a cancer or disease phenotype, can be 
evaluated in a cancer diagnostic test. In a preferred embodiment, gene expression 
monitoring is performed simultaneously on a number of genes. Multiple protein expression 
monitoring can be performed as well. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of cancer sequences in a particular cell. 
The assays are further described below in the example. PCR techniques can be used to 
provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the cancer protein are detected. 
Although DNA or RNA encoding the cancer protein may be detected, of particular interest 
are methods wherein an mRNA encoding a cancer protein is detected. Probes to detect 
mRNA can be a nucleotide/deoxynucleotide probe that is complementary to and hybridizes 
with the mRNA and includes, but is not limited to, oligonucleotides, cDNA, or RNA. 
Probes also should contain a detectable label, as defined herein. In one method the mRNA 
is detected after immobilizing the nucleic acid to be examined on a solid support such as 
nylon membranes and hybridizing the probe with the sample. Following washing to remove 
the non-specifically bound probe, the label is detected. In another method, detection of the 
mRNA is performed in situ. In this method permeabilized cells or tissue samples are 
contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 
to hybridize with the target mRNA. Following washing to remove the non-specifically 
bound probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA 
probe) that is complementary to the mRNA encoding a cancer protein is detected by binding 
the digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5-bromo-4-chloro-3-inddyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane, or intracellular proteins) are used in diagnostic 
assays. The cancer proteins, antibodies, nucleic acids, modified proteins, and cells 
containing cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 
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As described and defined herein, cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of cancer, e.g., for prognostic or 
diagnostic purposes. Detection of these proteins in putative cancer tissue allows for 
detection, prognosis, or diagnosis of cancer or similar disease, and for selection of 
therapeutic strategy. In one embodiment, antibodies are used to detect cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the cancer protein is detected, 
e.g., by immunoblotting with antibodies raised against the cancer protein. 

In another preferred method, antibodies to the cancer protein find use in in situ 
imaging techniques, e.g., in histology. See, e.g., Asai, et al. (eds. 1993) Methods in Cell 
Biology: Antibodies in Cell Biology (vol. 37) Academic Press. In this method, cells are 
contacted with from one to many antibodies to the cancer protein(s). Following washing to 
remove non-specific antibody binding, the presence of the antibody or antibodies is 
detected. In one embodiment the antibody is detected by incubating with a secondary 
antibody that contains a detectable label. In another method the primary antibody to the 
cancer protein(s) contains a detectable label, e.g., an enzyme marker that can act on a 
substrate. In another preferred embodiment each one of multiple primary antibodies 
contains a distinct and detectable label. This method finds particular use in simultaneous 
screening for a plurality of cancer proteins. Many other histological imaging techniques are 
also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing cancer from 
blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 
samples to be probed or tested for the presence of cancer proteins. Antibodies can be used 
to detect a cancer protein by previously described immunoassay techniques including 
ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE technology 
and the like. Conversely, the presence of antibodies may indicate an immune response 
against an endogenous cancer protein. 

In a preferred embodiment, in situ hybridization of labeled cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including cancer 
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tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is 
then performed. When comparing the fingerprints between an individual and a standard, a 
diagnosis, a prognosis, or a prediction may be based on the findings. It is further 
understood that the genes which indicate the diagnosis may differ from those which indicate 
the prognosis and molecular profiling of the condition of the cells may lead to distinctions 
between responsive or refractory conditions or may be predictive of outcomes. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins, and cells containing cancer sequences are used in prognosis assays. As above, 
gene expression profiles can be generated that correlate to cancer, clinical, pathological, or 
other information, in terms of long term prognosis. Again, this may be done on either a 
protein or gene level, with the use of genes being preferred. Single or multiple genes may 
be useful in various combinations. As above, cancer probes may be attached to biochips for 
the detection and quantification of cancer sequences in a tissue or patient. The assays 
proceed as outlined above for diagnosis. PCR method may provide more sensitive and 
accurate quantification. 
Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The cancer proteins, antibodies, nucleic acids, 
modified proteins, and cells containing cancer sequences are used in drug screening assays 
or by evaluating the effect of drug candidates on a "gene expression profile" or expression 
profile of polypeptides. In a preferred embodiment, the expression profiles are used, 
preferably in conjunction with high throughput screening techniques, to allow monitoring 
for expression profile genes after treatment with a candidate agent (e.g., Zlokarnik, et al. 
(1998) Science 279:84-88; Heid (1996) Genome Res. 6:986-994. 

In a preferred embodiment, the cancer proteins, antibodies, nucleic acids, modified 
proteins and cells containing the native or modified cancer proteins are used in screening 
assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the cancer phenotype or an identified physiological function 
of a cancer protein. As above, this can be done on an individual gene level or by evaluating 
the effect of drug candidates on a n gene expression profile". In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques, to allow monitoring for expression profile genes after treatment with a candidate 
agent, see Zlokarnik, supra. 
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Having identified the differentially expressed genes herein, a variety of assays may 
be performed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
5 cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing cancer, with changes of at least 10%, 
preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in cancer tissue compared to normal 

10 tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in cancer 
tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 

1 5 monitored, e.g., through the use of antibodies to the cancer protein and standard 

immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will 
20 typically involve a plurality of those entities described herein. 

In this embodiment, the cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of cancer sequences in a particular cell. 
Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may be used with 
dispensed primers in desired wells. A PCR reaction can then be performed and analyzed for 
25 each well. 

Modulators of cancer 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more cancer-associated sequences, e.g., a polynucleotide sequence set 
30 out in Table 2 or SEQ ID NOs: 1-58. Generally, in a preferred embodiment, a test 

modulator is added to the cells prior to analysis. Moreover, screens are also provided to 
identify agents that modulate cancer, modulate cancer proteins, bind to a cancer protein, or 
interfere with the binding of a cancer protein and an antibody or other binding partner. 
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The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the cancer phenotype or the expression of a cancer sequence, e.g., a nucleic 
acid or protein sequence. In preferred embodiments, modulators alter expression profiles, 
or expression profile nucleic acids or proteins provided herein. In one embodiment, the 
modulator suppresses a cancer phenotype, e.g., to a normal or non-malignant tissue 
fingerprint. In another embodiment, a modulator induced a cancer phenotype. Generally, a 
plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
differential response to the various concentrations. Typically, one of these concentrations 
serves as a negative control, e.g., at zero concentration or below the level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 
than 100 and less than about 2,500 daltons. Preferred small molecules are less than 2000, or 
less than 1500, or less than 1000, or less than 500 D. Candidate agents comprise functional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, and 
typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least 
two of the functional chemical groups. The candidate agents often comprise cyclical carbon 
or heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or 
more of the above functional groups. Candidate agents are also found among biomolecules 
including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, 
structural analogs, or combinations thereof. Particularly preferred are peptides. 

In one aspect, a modulator will neutralize the effect of a cancer protein. By 
"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a cancer polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. See, e.g., Janzen (2002) High Throughput Screening: 
Methods and Protocols Humana; Devlin (ed. 1997) High Throughput Screening: The 
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Discovery of Bioactive Substances Dekker; and Mei and Czarnik (eds. 2002) Integrated 
Drug Discovery Techniques Dekker. 

In one preferred embodiment, high throughput screening methods involve providing 
a library containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual 
therapeutics. 

10 A combinatorial chemical library is a collection of diverse chemical compounds 

generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 

15 length (e.g., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop, et al. 0994^1 J. Med. Chem. 37:1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known. Such 
combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., 

20 U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487-493, Houghton, et al. 

(1991) Nature 354:84-88), peptoids (PCT Publication No WO 91/19735), encoded peptides 
(PCT Publication WO 93/20242), random bio-oligomers (PCT Publication WO 92/00091), 
benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6909- 

25 6913, vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 1 14:6568-570), 
nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et al. (1992) 
J. Amer. Chem. Soc. 1 14:9217-9218), analogous organic syntheses of small compound 
libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 1 16:2661-662), oligocarbamates (Cho, et 
al. (1993) Science 261:1303-1305), and/or peptidyl phosphonates (Campbell, et al. (1994) I 

30 Org. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385-1401, 
nucleic acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., 
U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et al. (1996) Nature 
Biotechnology 14(3):309-314, and PCT/US96/10287), carbohydrate libraries (see, e.g., 
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Liang, et al. (1996) Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, page 33 Baum (Jan 18, 1993) C&EN; 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 
No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino 
5 compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and 
the like). 

Devices for the preparation of combinatorial libraries are commercially available 
(see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, 
Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, 
10 MA). 

A number of well known robotic systems have also been developed for solution 
phase chemistries. These systems include automated workstations like the automated 
synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and 
many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, 

1 5 Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic manual synthetic operations 
performed by a chemist. The above devices are suitable for use with the present invention. 
The nature and implementation of modifications to these devices (if any) so that they can 
operate as discussed herein will be apparent. In addition, numerous combinatorial libraries 
are themselves commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, 

20 Moscow, Ru, Tripos, Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D 
Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of cancer gene transcription, 
inhibition, or enhancement of polypeptide expression, and inhibition or enhancement of 

25 polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties 
of particular nucleic acids or protein products are well known. Similarly, binding assays 
and reporter gene assays are similarly well known. Thus, e.g., U.S. Patent No. 5,559,410 
discloses high throughput screening methods for proteins, U.S. Patent No. 5,585,639 

30 discloses high throughput screening methods for nucleic acid binding (e.g., in arrays), while 
U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of screening 
for ligand/antibody binding. 
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In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, M^ etc.). These systems 
typically automate entire procedures, including sample and reagent pipetting, liquid 
5 dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as 
well as a high degree of flexibility and customization. The manufacturers of such systems 
provide detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
1 0 gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, 
or random or directed digests of proteinaceous cellular extracts, may be used. In this way' 
libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Particularly useful test compound will be directed to the class of proteins to 
which the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids 
with from about 5-20 amino acids being preferred, and from about 7-15 being particularly ' 
preferred. The peptides may be digests of naturally occurring proteins, random peptides or 
Wd» random peptides. By "randomized" or grammatical equivalents herein is meant 
that each nucleic acid and peptide consists of essentially random nucleotides and amino 
acids, respectively. Since generally these random peptides (or nucleic acids, discussed 
below) are chemically synthesized, they may incorporate a nucleotide or amino acid at any 
position. The synthetic process can be designed to generate randomized proteins or nucleic 
acids, to allow the formation of all or most of the possible combinations over the length of 
the sequence, thus forming a library of randomized candidate bioactive proteinaceous 
agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is some 
portions within the sequence are either held constant, or are selected from a limited number 
of possibilities. For example, in a preferred embodiment, the nucleotides or amino acid 
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residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation 
of nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for 
SH-3 domains, serines, threonines, tyrosines, orhistidines for phosphorylation sites, etc., or 
to purines, etc. 

Modulators of cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of prokaryotic or eukaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, 
a wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the stxeptavidin 
is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

These assays can be direct hybridization assays or can comprise "sandwich assays", 
which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 
5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 
5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246, and 5,681,697, all of 
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which are hereby incorporated by reference. In this embodiment, in general, the target 
nucleic acid is prepared as outlined above, and then added to the biochip comprising a 
plurality of nucleic acid probes, under conditions that allow the formation of a hybridization 
complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate, and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 
complex only in the presence of target. Stringency can be controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outhned in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different 
orders, with preferred embodiments outlined below. In addition, the reaction may include a 
vanety of other reagents. These include salts, buffers, neutral proteins, e.g., albumin 
detergents, etc. which may be used to facilitate optimal hybridization and detection, and/or 
reduce non-specific or background interactions. Reagents that otherwise improve the 
efficiency of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial 
agents, etc., may also be used as appropriate, depending on the sample preparation methods 
and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In 
another embodiment, e.g., for diagnostic applications, having identified differentially 
expressed genes important in a particular state, screens can be performed to identify 
modulators that alter expression of individual genes. In an another embodiment, screening 
is performed to identify modulators that alter a biological function of the expression product 
of a differentially expressed gene. Again, having identified the importance of a gene in a 
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particular state, screens are performed to identify agents that bind and/or modulate the 
biological activity of the gene product. 

In addition, screens can be done for genes that are induced in response to a candidate 
agent or treatment process. After identifying a modulator based upon its ability to suppress 

5 a cancer expression pattern leading to a normal expression pattern (or its converse), or to 
modulate a single cancer gene expression profile so as to mimic the expression of the gene 
from normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated cancer tissue reveals genes that are not expressed in normal 

10 tissue or cancer tissue, but are expressed in agent treated tissue. These agent-specific 
sequences can be identified and used by methods described herein for cancer genes or 
proteins. In particular, these sequences and the proteins they encode find use in marking or 
identifying agent treated cells. In addition, antibodies can be raised against the agent 
induced proteins and used to target novel therapeutics, e.g., toxin loaded liposomes, to the 

1 5 treated cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of cancer 
cells that have an associated cancer expression profile. By "administration" or "contacting" 
herein is meant that the candidate agent is added to the cells in such a manner as to allow 
the agent to act upon the cell, whether by uptake and intracellular action, or by action at the 

20 cell surface. In some embodiments, nucleic acid encoding a proteinaceous candidate agent 
(e.g., a peptide) may be put into a viral construct such as an adenoviral or retroviral 
construct, and added to the cell, such that expression of the peptide agent is accomplished, 
e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 

25 desked and are allowed to incubate under preferably physiological conditions for some 

period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., cancer or non-malignant tissue may be screened for agents that modulate, 
e.g., induce or suppress a cancer phenotype. A change in at least one gene, preferably 
30 many, of the expression profile indicates that the agent has an effect on cancer activity. By 
defining such a signature for the cancer phenotype, screens for new drugs that alter the 
phenotype can be devised. With this approach, the drug target need not be known and need 
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not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual 
genes and gene products (proteins). That is, having identified a particular differentially 
5 expressed gene as important in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "cancer proteins" or a 
"cancer modulatory protein". The cancer modulatory protein may be a fragment, or 
alternatively, be the full length protein to the fragment encoded by the nucleic acids of 

10 Table 2 or SEQ ID NOs: 1 -58. Preferably, the cancer modulatory protein is a fragment In a 
preferred embodiment, the cancer amino acid sequence which is used to determine sequence 
identity or similarity is encoded by a nucleic acid of the Table 2 or SEQ ID NOs:l-58. In 
another embodiment, the sequences are naturally occurring allelic variants of a protein 
encoded by a nucleic acid of the Table 2 or SEQ ID NOs: 1-5 8. In another embodiment, the 

15 sequences are sequence variants as further described herein. 

Preferably, the cancer modulatory protein is a fragment of about 14-24 amino acids 
long. More preferably the fragment is a soluble fragment. Preferably, the fragment 
includes a non-transmembrane region. In a preferred embodiment, the fragment has an N- 
terminal Cys to aid in solubility. In one embodiment, the C-terminus of the fragment is kept 

20 as a free acid and the N-terminus is a free amine to aid in coupling, e.g., to cysteine. 

In one embodiment the cancer proteins are conjugated to an immunogenic agent as 
discussed herein. In one embodiment the cancer protein is conjugated to BSA. 

Measurements of cancer polypeptide activity, or of cancer or the cancer phenotype 
can be performed using a variety of assays. For example, the effects of the test compounds 

25 upon the function of the cancer polypeptides can be measured by examining parameters 

described above. A suitable physiological change that affects activity can be used to assess 
the influence of a test compound on the polypeptides of this invention. When the functional 
consequences are determined using intact cells or animals, one can also measure a variety of 
effects such as, in the case of cancer associated with tumors, tumor growth, tumor 

30 metastasis, neovascularization, hormone release, transcriptional changes to both known and 
uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 
cell growth or pH changes, and changes in intracellular second messengers such as cGMP. 
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In the assays of the invention, mammalian cancer polypeptide is typically used, e.g., mouse, 
preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a cancer polypeptide is first contacted with a potential modulator and 

5 incubated for a suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the 
cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is typically measured using immunoassays such as western 
blotting, ELIS A, and the like with an antibody that selectively binds to the cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

10 PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 

blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a cancer protein promoter 

1 5 operably linked to a reporter gene such as luciferase, green fluorescent protein, CAT, or j3- 
gal. The reporter construct is typically transfected into a cell. After treatment with a 
potential modulator, the amount of reporter gene transcription, translation, or activity is 
measured according to standard techniques. 

In a preferred embodiment, as outlined above, screens may be done on individual 

20 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression 
of the gene or the gene product itself can be done. The gene products of differentially 
expressed genes are sometimes referred to herein as "cancer proteins." The cancer protein 
may be a fragment, or alternatively, the full length protein to a fragment shown herein. 

25 In one embodiment, screening for modulators of expression of specific genes is 

performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are 

30 identified, variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed 
nucleic acids are made. For example, antibodies are generated to the protein gene products, 
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and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a cancer protein 
and a candidate compound, and determining the binding of the compound to the cancer 
5 protein. Preferred embodiments utilize the human cancer protein, although other 
mammalian proteins may also be used, e.g., for the development of animal models of 
human disease. In some embodiments, as outlined herein, variant or derivative cancer 
proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the cancer protein or 

10 the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 
screening. The surface of such supports may be solid or porous and of a convenient shape. 

15 Examples of suitable insoluble supports include microtiter plates, arrays, membranes, and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient 
because a large number of assays can be carried out simultaneously, using small amounts of 
reagents and samples. The particular manner of binding of the composition is typically not 

20 crucial so long as it is compatible with the reagents and overall methods of the invention, 
maintains the activity of the composition, and is nondiffusable. Preferred methods of 
binding include the use of antibodies (which do not sterically block either the ligand binding 
site or activation sequence when the protein is bound to the support), direct binding to 
"sticky" or ionic supports, chemical crosslinking, the synthesis of the protein or agent on the 

25 surface, etc. Following binding of the protein or agent, excess unbound material is removed 
by washing. The sample receiving areas may then be blocked through incubation with 
bovine serum albumin (BSA), casein, or other innocuous protein or other moiety. 

In a preferred embodiment, the cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 

30 and the cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc. Of 
particular interest are screening assays for agents that have a low toxicity for human cells. 
A wide variety of assays may be used for this purpose, including labeled in vitro protein- 
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protein binding assays, electrophoretic mobility shift assays, immunoassays for protein 
binding, functional assays (phosphorylation assays, etc.), and the like. 

The determination of the binding of the test modulating compound to the cancer 
protein may be done in a number of ways. In a preferred embodiment, the compound is 
5 labeled, and binding determined directly, e.g., by attaching all or a portion of the cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the solid 
support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
10 proteinaceous candidate compounds) can be labeled. Alternatively, more than one 

component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by competitive 

1 5 binding assay. The competitor may be a binding moiety known to bind to the target 

molecule (e.g., a cancer protein), such as an antibody, peptide, binding partner, ligand, etc. 
Under certain circumstances, there may be competitive binding between the compound and 
the binding moiety, with the binding moiety displacing the compound. In one embodiment, 
the test compound is labeled. Either the compound, or the competitor, or both, is added first 

20 to the protein for a time sufficient to allow binding, if present. Incubations may be 

performed at a temperature which facilitates optimal activity, typically between about 4-40° 
C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
screening. Typically between 0.1-1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or 

25 absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by a test 
compound. Displacement of the competitor is an indication that the test compound is 
binding to the cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the cancer protein. In this embodiment, either component can be labeled. 

30 Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of 
the label on the support indicates displacement. 
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In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may 
indicate that the test compound is bound to the cancer protein with a higher affinity. Thus, 
if the test compound is labeled, the presence of the label on the support, coupled with a lack 
5 of competitor binding, may indicate that the test compound is capable of binding to the 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the cancer proteins. In one 
embodiment, the methods comprise combining a cancer protein and a competitor in a first 

10 sample. A second sample comprises a test compound, a cancer protein, and a competitor. 
The binding of the competitor is determined for both samples, and a change, or difference in 
binding between the two samples indicates the presence of an agent capable of binding to 
the cancer protein and potentially modulating its activity. That is, if the binding of the 
competitor is different in the second sample relative to the first sample, the agent is capable 

15 of binding to the cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to 
the native cancer protein, but cannot bind to modified cancer proteins. The structure of the 
cancer protein may be modeled, and used in rational drug design to synthesize agents that 
interact with that site. Drug candidates that affect the activity of a cancer protein are also 

20 identified by screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 

25 protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine 
the amount of bound compound. 

A variety of other reagents may be included in the screening assays. These include 

30 reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
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protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The 
mixture of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a cancer protein. The methods comprise 
5 adding a test compound, as defined above, to a cell comprising cancer proteins. Preferred 
cell types include almost any cell The cells contain a recombinant nucleic acid that 
encodes a cancer protein. In a preferred embodiment, a library of candidate agents are tested 
on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
10 subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In 
another example, the determinations are determined at different stages of the cell cycle 
process. 

15 In this way, compounds that modulate cancer agents are identified. Compounds 

with pharmacological activity are able to enhance or interfere with the activity of the cancer 
protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting cancer cell division is provided. The 
20 method comprises administration of a cancer inhibitor. In another embodiment, a method of 
inhibiting cancer is provided. The method may comprise administration of a cancer 
inhibitor. In a further embodiment, methods of treating cells or individuals with cancer are 
provided, e.g., comprising administration of a cancer inhibitor. 

In one embodiment, a cancer inhibitor is an antibody as discussed above. In another 
25 embodiment, the cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, viability, and metastasis assays are available, 
as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
30 transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
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grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells' ability to grow in stirred suspension culture or suspended in semi- 
5 solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described, e.g., in Freshney (1998) Culture of Animal Cells: A Manual of Basic Technique 
(3d ed.) Wiley-Liss; Freshney (2000) Culture of Animal Cells: A Manual of Basic 
Technique (4th ed.) Wiley-Liss; and Garkavtsev, et al. (1996) Nature Genet. 14:415-20. 

1 0 Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to 

1 5 a higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney 
(2000), supra. The transformed cells, when transfected with tumor suppressor genes, 

20 regenerate a normal phenotype and become contact inhibited and would grow to a lower 
density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 

method of measuring density limitation of growth. Transformed host cells are transfected 

with a cancer-associated sequence and are grown for 24 hours at saturation density in non- 
25 limiting medium conditions. The percentage of cells labeling with (3H)-thymidine is 

determined autoradiographically. See, Freshney (1998), supra. 

Growth factor or serum dependence^ 

Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J.Natl. Cancer Insti. 37:167-175; Eagle, et al.(1970) I 
30 Exp. Med. 131:836-879); Freshney, supra. This is in part due to release of various growth 

factors by the transformed cells. Growth factor or serum dependence of transformed host 

cells can be compared with that of control. 
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Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers' 1 ) than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
5 Gullino "Angiogenesis, tumor vascularization, and potential interference with tumor 
growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. 
Similarly, tumor angiogenesis factor (TAF) is released at a higher level in tumor cells than 
their normal counterparts. See, e.g., Folkman (1992) Sem. Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
10 Freshney (1998), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 
Strickland and Beers (1976) J. Biol. Chem. 251 :5694-5702; Whur, et al. (1980) Br. J. 
Cancer 42:305-312; Gullino "Angiogenesis, tumor vascularization, and potential 
interference with tumor growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in 
Cancer Plenum: Freshney (1985) Anticancer Res. 5:11 1-130. 
1 5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate cancer-associated 
sequences. Tumor cells exhibit a good correlation between malignancy and invasiveness of 
cells into Matrigel or some other extracellular matrix constituent. In this assay, tumorigenic 
20 cells are typically used as host cells. Expression of a tumor suppressor gene in these host 
cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 
invasion of host cells can be measured by using filters coated with Matrigel or some other 
extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 
25 filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side 
of the filter or bottom of the dish. See, e.g., Freshney (1 984), supra. 
Tumor growth in vivo 

Effects of cancer-associated sequences on cell growth can be tested in transgenic or 
30 immune-suppressed mice. Knock-out transgenic mice can be made, in which the cancer 
gene is disrupted or in which a cancer gene is inserted. Knock-out transgenic mice can be 
made by insertion of a marker gene or other heterologous gene into the endogenous cancer 
gene site in the mouse genome via homologous recombination. Such mice can also be 
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made by substituting the endogenous cancer gene with a mutated version of the cancer gene, 
or by mutating the endogenous cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, 
5 which is re-implanted into a recipient female. Some of these embryos develop into chimeric 
mice that possess germ cells partially derived from the mutant cell line. Therefore, by 
breeding the chimeric mice it is possible to obtain a new line of mice containing the 
introduced genetic lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288-1292). 
Chimeric targeted mice can be derived according to Hogan, et al. (1988) Manipulating the 
10 Mouse Embryo: A Laboratory Manual CSH Press; and Robertson (ed. 1987) 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach IRL Press, 
Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 

15 Natl. Cancer Inst 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41:52-61) can be used as a host. Transplantable tumor cells (typically about 10 6 
cells) injected into isogenic hosts will produce invasive tumors in a high proportions of 
cases, while normal cells of similar origin will not. In hosts which developed invasive 

20 tumors, cells expressing a cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably about 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 
statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. 

25 

Polynucleotide modulators of cancer 
Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a cancer-associated protein is down- 
regulated, or entirely inhibited, by the use of an inhibitory or antisense polynucleotide, e.g., 
30 a nucleic acid complementary to, and which can preferably hybridize specifically to, a 
coding mRNA nucleic acid sequence, e.g., a cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation 
and/or stability of the mRNA. 
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In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur 
5 containing species. Analogs are comprehended by this invention so long as they function 
effectively to hybridize with the cancer protein mRNA. See, e.g., Isis Pharmaceuticals, 
Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, 
or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 

10 including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oligonucleotides. 
Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 

1 5 acid sequence (either KNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for cancer molecules. A preferred antisense molecule is for a cancer 
sequence in the Table 2 or the attached listing of SEQ ID NOs:l-l 16, or for a ligand or 
activator thereof. Antisense or sense oligonucleotides, according to the present invention, 
comprise a fragment generally at least about 14 nucleotides, preferably from about 14-30 

20 nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a 
cDNA sequence encoding a given protein is described in, e.g., Stein and Cohen (1988) 
Cancer Res. 48:2659-2668; and van der Krol, et al. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression in a sequence specific 
manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 

25 Genes Dev. 13:139-141; and Cathew (2000 Curr. On. Cell Biol. 13:244-248. In 

mammalian cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have 
been shown to be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) 
Nature 41 1 :494-498. The mechanism may be used to downregulate expression levels of 
identified genes, e.g., treatment of or validation of relevance to disease. 

30 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of cancer-associated nucleotide sequences. A ribozyme is an RNA molecule 
that catalytically cleaves other RNA molecules. Different kinds of ribozymes have been 
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described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, RNase 
P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 25: 289- 
317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. 
5 (1990) Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. 
Patent No. 5,254,678. Methods of preparation are described in, e.g., WO 94/26877; 
Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. (1994) 
Human Gene Therapy 1:39-45; Leavitt, et al.(1995) Proc. Natl. Acad. Sci. USA 92:699- 
703; Leavitt, et al. (1994) Human Gene Therapy 5:1 151-120; and Yamada, et al. (1994) 

10 Virology 205: 121-126. 

Polynucleotide modulators of cancer may be introduced into a cell containing the 
target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited 
to, cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 

15 surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of cancer 
may be introduced into a cell containing the target nucleic acid sequence, e.g., by formation 

20 of an polynucleotide-lipid complex, as described in WO 90/10448. It is understood that the 
use of antisense molecules or knock out and knock in models may also be used in screening 
assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating cancer in cells or organisms are 
provided. In one embodiment, the methods comprise administering to a cell an anti-cancer 

25 antibody that reduces or eliminates the biological activity of an endogenous cancer protein. 
Alternatively, the methods comprise administering to a cell or organism a recombinant 
nucleic acid encoding a cancer protein. This may be accomplished in any number of ways. 
In a preferred embodiment, e.g., when the cancer sequence is down-regulated in cancer, 
such state may be reversed by increasing the amount of cancer gene product in the cell. 

30 This can be accomplished, e.g., by overexpressing the endogenous cancer gene or 

administering a gene encoding the cancer sequence, using known gene-therapy techniques. 
In a preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), e.g., as described in 
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PCT/US93/0386. Alternatively, e.g., when the cancer sequence is up-regulated in cancer, 
the activity of the endogenous cancer gene is decreased, e.g., by the administration of a 
cancer antisense or other inhibitor, e.g., RNAi. 

In one embodiment, the cancer proteins of the present invention may be used to 

5 generate polyclonal and monoclonal antibodies to cancer proteins. Similarly, the cancer 
proteins can be coupled, using standard technology, to affinity chromatography columns. 
These columns may then be used to purify cancer antibodies useful for production, 
diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies are 
generated to epitopes unique to a cancer protein; that is, the antibodies show little or no 

10 cross-reactivity to other proteins. The cancer antibodies may be coupled to standard affinity 
chromatography columns and used to purify cancer proteins. The antibodies may also be 
used as blocking polypeptides, as outlined above, since they will specifically bind to the 
cancer protein. 

Methods of identifying variant cancer-associated sequences 

1 5 Without being bound by theory, expression of various cancer sequences is correlated 

with cancer. Accordingly, disorders based on mutant or variant cancer genes may be 
determined. In one embodiment, the invention provides methods for identifying cells 
containing variant cancer genes, e.g., determining all or part of the sequence of at least one 
endogenous cancer gene in a cell. In a preferred embodiment, the invention provides 

20 methods of identifying the cancer genotype of an individual, e.g., determining all or part of 
the sequence of at least one cancer gene of the individual. This is generally done in at least 
one tissue of the individual, and may include the evaluation of a number of tissues or 
different samples of the same tissue. The method may include comparing the sequence of 
the sequenced cancer gene to a known cancer gene, e.g., a wild-type gene. 

25 The sequence of all or part of the cancer gene can then be compared to the sequence 

of a known cancer gene to determine if any differences exist. This can be done using 
known homology programs, such as Bestfit, etc. In a preferred embodiment, the presence 
of a difference in the sequence between the cancer gene of the patient and the known cancer 
gene correlates with a disease state or a propensity for a disease state, as outlined herein. 

30 In a preferred embodiment, the cancer genes are used as probes to determine the 

number of copies of the cancer gene in the genome. 

In another preferred embodiment, the cancer genes are used as probes to determine 
the chromosomal localization of the cancer genes. Information such as chromosomal 
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localization finds use in providing a diagnosis or prognosis in particular when chromosomal 
abnormalities such as translocations, and the like are identified in the cancer gene locus. 
Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a cancer protein or modulator 
5 thereof, is administered to a patient. By "therapeutically effective dose" herein is meant a 
dose that produces effects for which it is administered. The exact dose will depend on the 
purpose of the treatment, and will be ascertainable using known techniques. See, e.g., 
Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug Delivery Lippincott; 
Lieberman (1992) Pharmaceutical Dosage Forms (vols. 1-3) Dekker, ISBN 0824770846, 

10 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science and Technology of 
Pharmaceutical Compounding Amer. Pharmaceut Assn.; and Pickar (1998) Dosage 
Calculations Thomson. Adjustments for cancer degradation, systemic versus localized 
delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction, and the severity of the condition may be 

15 necessary. U.S. Patent Application No. 09/687,576, further discloses the use of 
compositions and methods of diagnosis and treatment in cancer. 

A "patient" for the purposes of the present invention includes both humans and other 
animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 

20 primate, and in the most preferred embodiment the patient is human. 

The administration of the cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, 
subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, 
intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In some instances, 

25 e.g., in the treatment of wounds and inflammation, the cancer proteins and modulators may 
be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a cancer protein 
in a form suitable for administration to a patient In the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 

30 pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
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sulfuric acid, nitric acid, phosphoric acid, and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid, and the 

5 like. "Pharmaceutically acceptable base addition salts" include those derived from 

inorganic bases such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, 
zinc, copper, manganese, aluminum salts, and the like. Particularly preferred are the 
ammonium, potassium, sodium, calcium, and magnesium salts. Salts derived from 
phannaceutically acceptable organic non-toxic bases include salts of primary, secondary, 

10 and tertiary amines, substituted amines including naturally occurring substituted amines, 
cyclic amines and basic ion exchange resins, such as isopropylamine, trimethylamine, 
diethylamine, triethylamine, tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 

1 5 lactose, com and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, 

20 capsules and lozenges. It is recognized that cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing 
the molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, 
or by packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 

25 protection barrier. Means of protecting agents from digestion are available. 

The compositions for administration will commonly comprise a cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 
are sterile and generally free of undesirable matter. These compositions may be sterilized 

30 by conventional, well known sterilization techniques. The compositions may contain 

pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents, and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate, 
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and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight, and the like in 
accordance with the particular mode of administration selected and the patienfs needs (e.g., 
(1980) Remington's Pharmaceutical Science (18th ed.) Mack, and Hardman and Limbird 
5 (eds. 2001) Goodman and Gilman: The Pharmacological Basis of Therapeutics (10th ed.) 
McGraw-Hill. 

Thus, a typical pharmaceutical composition for intravenous administration would be 
about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient 
per day may be used, particularly when the drug is administered to a secluded site and not 

1 0 into the blood stream, such as into a body cavity or into a lumen of an organ. Substantially 
higher dosages are possible in topical administration. Actual methods for preparing 
parenterally administrable compositions will be known or apparent. 

The compositions containing modulators of cancer proteins can be administered for 
therapeutic or prophylactic treatments. In therapeutic applications, compositions are 

15 administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 
cure or at least partially arrest the disease and its complications. An amount adequate to 
accomplish this is defined as a "therapeutically effective dose." Amounts effective for this 
use will depend upon the severity of the disease and the general state of the patients health. 
Single or multiple administrations of the compositions may be administered depending on 

20 the dosage and frequency as required and tolerated by the patient. In any event, the 
composition should provide a sufficient quantity of the agents of this invention to 
effectively treat the patient. An amount of modulator that is capable of preventing or 
slowing the development of cancer in a mammal is referred to as a "prophylactically 
effective dose." The particular dose required for a prophylactic treatment will depend upon 

25 the medical condition and history of the mammal, the particular cancer being prevented, as 
well as other factors such as age, weight, gender, administration route, efficiency, etc. Such 
prophylactic treatments may be used, e.g., in a mammal who has previously had cancer to 
prevent a recurrence of the cancer, or in a mammal who is suspected of having a significant 
likelihood of developing cancer based, at least in part, upon gene expression profiles. 

30 Vaccine strategies may be used, in either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present cancer protein-modulating compounds can be 
administered alone or in combination with additional cancer modulating compounds or with 
other therapeutic agent, e.g., other anti-cancer agents or treatments. 
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In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Table 2 or the attached listing of SEQ ID 
NOs:l-58, such as RNAi, antisense polynucleotides or ribozymes, will be introduced into 
cells, in vitro or in vivo. The present invention provides methods, reagents, vectors, and 
5 cells useful for expression of cancer-associated polypeptides and nucleic acids using in vitro 
(cell-free), ex vivo or in vivo (cell or organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 

10 of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors, and other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA, or other foreign genetic material into a host cell 
(see, e.g., Berger and Kimmel (1987) Guide to Molecular Cloning Techniques from 
Methods in Enzvmology (vol. 152) Academic Press; Ausubel, et al. (eds. 1999 and 

15 supplements) Cunent Protocols Lippincott; and Sambrook, et al. (2001) Molecular Cloning: 
A Laboratory Manual (3d ed., Vol. 1-3) CSH Press. 

In a preferred embodiment, cancer proteins and modulators are administered as 
therapeutic agents, and can be formulated as outlined above. Similarly, cancer genes 
(including both the full-length sequence, partial sequences, or regulatory sequences of the 

20 cancer coding regions) can be administered in a gene therapy application. These cancer 
genes can include inhibitory applications, e.g., as inhibitory RNA, gene therapy (e.g., for 
incorporation into the genome), or antisense compositions. 

Cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses. Such vaccine compositions 

25 can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341- 
349), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294,; Alonso, et al. 
(1994) Vaccine 12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions 
contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) 

30 Nature 344:873-875; Hu, et al. (1998) Clin Exp Immunol. 1 13:235-243), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn (1988) Proc. Natl. Acad. Sci. USA 85:5409-5413; 
Tarn (1996) J. Immunol. Methods 196:17-32), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
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vectors (Perkus, et al., p. 379, in Kaufmann (ed. 1996) Concepts in Vaccine Development 
de Gruyter; Chakrabarti, et al. (1986) Nature 320:535-537; Hu, et al. (1986) Nature 
320:537-540; Kieny, et al. (1986) Bio/Technology 4:790-795; Top, et al. (1971) J. Infect. 
Pis. 124:148-154; Chanda, et al. (1990) Virology 175:535-547), particles of viral or 

5 synthetic origin (see, e.g., Kofler, et al. (1996) J. Immunol. Methods 192:25-35; Eldridge, et 
al. (1993) Sem. Hematol. 30:16-24; Falo, et al. (1995) Nature Med. 1:649-653), adjuvants 
(Warren, et al. (1986) Annu. Rev. Immunol. 4:369-388; Gupta, et al. (1993) Vaccine 
11:293-306), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585-1589; Rock (1996) 
Immunol. Today 17:131-137), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

10 Science 259:1745-1749; Robinson, et al. (1993) Vaccine 1 1:957-960; Shiver, et al., p 423, 
in Kaufmann (ed. 1996) Concepts in Vaccine Development de Gruyter; Cease and 
Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; and Eldridge, et al. (1993) Sem. 
Hematol. 30: 16-24). Toxin-targeted delivery technologies, also known as receptor mediated 
targeting, such as those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may 

15 also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis, or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 

20 as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron, or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

25 polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 

Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also 
be used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
encoding one or more of the polypeptides, or a fragment thereof, is administered to a 

30 patient. This approach is described, for instance, in Wolff et. al. (1990) Science 247:1465- 
1468, as well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 
5,679,647; WO 98/04720; and in more detail below. Examples of DNA-based delivery 
technologies include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) 
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delivery, cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated 
delivery (see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351 :456-460. A wide variety of other vectors are 
availablel for therapeutic administration or immunization, e.g., adeno and adeno-associated 
vims vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, 
and the like. See, e.g., Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) 
15 J. Leukoc. Biol. 68:793-806; Hipp, et al. (2000) In Vivo 14:571-85. 

Methods for the use of genes as DNA vaccines are well known, and include placing 
a cancer gene or portion of a cancer gene under the control of a regulatable promoter or a 
tissue-specific promoter for expression in a cancer patient. The cancer gene used for DNA 
vaccines can encode full-length cancer proteins, but more preferably encodes portions of the 
20 cancer proteins including peptides derived from the cancer protein. In one embodiment, a 
patient is immunized with a DNA vaccine comprising a plurality of nucleotide sequences 
derived from a cancer gene. For example, cancer-associated genes or sequence encoding 
subfragments of a cancer protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
25 responses. This procedure provides for production of cytotoxic T cell responses against 
cells which present antigen, including intracellular epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the cancer polypeptide encoded by the DNA vaccine. 
30 Additional or alternative adjuvants are available. 

In another preferred embodiment, cancer genes find use in generating animal models 
of cancer. When the cancer gene identified is repressed or diminished in cancer tissue, gene 
therapy technology, e.g., wherein inhibitory or antisense RNA directed to the cancer gene 
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will also diminish or repress expression of the gene. Animal models of cancer find use in 
screening for modulators of a cancer-associated sequence or modulators of cancer. 
Similarly, transgenic animal technology, including gene knockout technology, e.g., as a 
result of homologous recombination with an appropriate gene targeting vector, will result in 
the absence or increased expression of the cancer protein. When desired, tissue-specific 
expression or knockout of the cancer protein may be necessary. 

It is also possible that the cancer protein is overexpressed in cancer. As such, 
transgenic animals can be generated that overexpress the cancer protein. Depending on the 
desired expression level, promoters of various strengths can be employed to express the 
transgene. Also, the number of copies of the integrated transgene can be determined and 
compared for a determination of the expression level of the transgene. Animals generated 
by such methods will find use as animal models of cancer and are additionally useful in 
screening for modulators to treat cancer. 
Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits 
are also provided by the invention. In diagnostic and research applications, such kits may 
include at least one of the following: assay reagents, buffers, cancer-specific nucleic acids 
or antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, 
dominant negative cancer polypeptides or polynucleotides, small molecule inhibitors of 
cancer-associated sequences etc. A therapeutic product may include sterile saline or another 
pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials, they are not limited to such. A 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to, electronic 
storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), 
and the like. Such media may include addresses to internet sites that provide such 
instructional materials. 

The present invention also provides for kits for screening for modulators of cancer- 
associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
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cancer-associated activity. Optionally, the kit contains biologically active cancer protein. A 
wide variety of kits and components can be prepared according to the present invention, 
depending upon the intended user of the kit and the particular needs of the user. Diagnosis 
would typically involve evaluation of a plurality of genes or products. The genes will 
typically be selected based on correlations with important parameters in disease which may 
be identified in historical or outcome data. 



EXAMPLES 

Example 1: Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 
14:981-993). 
Table 1 

Table 1 lists medical conditions, pathologies, abnormalities, or organs affected by 
disease, referred to in Table 2, for which markers have been identified, and other related 
medical conditions (including various stages and/or metastases) in which those markers will 
also be useful, e.g., in therapeutic, diagnostic, prognostic, subsetting, vaccine, and other 
uses. 



Table 1 



blood 

vessels/angiogen 
esis: 

bladder: 


hemangiomas, lymphangiomas, angiosarcoma, lymphangiosarcoma, Kaposi's sarcoma, wound healing, tissue ^ 
remodeling, psoriasis, ischemic, heart disease, inflammatory diseases (e.g., arthritis, asthma, chronic bronchitis), 
atherosclerosis ^endometriosis, presumed ocular histoplasmosis syndrome, hypoxia, solid tumors, lymphomas 
lymphadenitis, lymphangitis, autoimmune diseases (e.g., RA, SLE, juvenile chronic arthritis, pigmented 
villonodular synovitis, etc.), retinal neovascularization syndromes (e.g., diabetic retinopathy, macular degeneration 
uteris fibro^ ^ Syndr ° me ' scIeritis/con Junctivitis, hypertrophic scars (keloid), birth control, ' 
1 carcinoma in situ, papillary carcinomas, transitional cell carcinoma, squamous cell carcinoma 


bone: 


Ewmg sarcoma, sarcomas arising from skeletal and extraskeletal connective tissues, including the peripheral 
nervous system (e.g. chondrosarcoma, osteosarcoma) 


brain: 
breast: 


glioblastoma, oligodendroghoma, anablastic astrocytoma, meningioma, medulablastoma, neuroblastoma 
ependymoma, schwannoma, craniopharyngioma, pineoblastoma, pineocytoma, neurofibroma, neurofibrosarcoma 
malignant peripheral nerve sheath tumors, granular cell tumors, plexosarcoma, ganglioneuroblastoma, 
neuroepithelioma, neuroma, ganglioneuroma " 

ductal carcinoma in situ, lobular carcinoma in situ " " — ~ 


cervix: 
colon/rectum: 


cancer of the cervix, vagina, or vulva — — 

precancerous colorectal disease (e.g., neoplastic polyps (adenomas), familial adenomatous polyposis, ulcerative 
colitis), colon cancer, e.g., epithelial tumor (e.g., adenocarcinoma, mucinous adenocardnoirSetring cell 
adenocarcinoma, squamous cell carcinoma, adenosquamous carcinoma, undifferentiated carcinoma, unclassified 
carcinoma) earmold tumor (e.g., argentaffin, nonargentaffin, composite), non-epithelial tumor (e.g., leimyo 
sarcoma, others), inflammatory bowel disease (e.g., ulcerative colitis, Crohn's disease (granulomatous colitis) 
dysplasia), rectal cancer, cancer of the anal region (e.g, squamous cell carcinoma, transitional carcinoma 
adenocarcinoma, carcinoma, papillary villous carcinoma, mucinous adenocarcinoma, melanoma) 


esophagus: 


premangnant or Fedisposmg conditions (e.g., esophagitis), squamous cell cancers (e.g., cancers of the head and 
neck lung, or cervix), gas trod i Restive carcinomas (e.g., cancers of the stomach, colon, or recrumi 


fibrosis: " 
head and neck: 


P i lmo . nar >; hypersensitivity pneumonitis,interstitial pneumonitis, nonspecific 
^iS^! * X / c ; hrom f u obs ^ uct,vc Pulmonary disease (e.g., emphysema, chronic bronchitis), aVthma, 
bronchiectasis, cirrhosis fliver fibrosis), renal fibrosis, scleroderma, wound healing 

tumors of the nasal cavity, paranasal sinuses, nasopharynx, oral cavity, oral pharynx, lip, larynx, hypopharynx. """ 
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salivary glands, paragangliomas, esophagus 


kidney: 


clear cell (nonpapillary) carcinoma, papillary carcinoma, chromophobe renal carcinoma, hypernephroma, 
adenocarcinoma, sporadic renal carcinomas, hereditary renal carcinomas (von Hippel-Lindau disease), carcinoma 
of the renal pelvis, ureteral carcinoma, fibroma, papillary adenoma, angiomyolipoma, oncocytoma 


leukocytes: 


acute lymphoblastic leukemia/lymphoma, chronic lymphocytic leukemia, follicular lymphoma, large B-cell 
lymphoma, Burkitt lymphoma, plasma cell neoplasms, mantle cell lymphoma, rymphoplasmacytic lymphoma, 
peripheral T-cell lymphoma, adult T-cell leukemia/lymphoma, Hodgkin disease, acute myelogenous leukemia, 
chronic myelogenous leukemia, thymic hyperplasia, hairy cell leukemia, malignant transformation, inappropriate 
activation or abnormalities of leukocytes (e.g., immature, precursor B (pre-B) or precursor T (pre-T) lymphocytes, 
monocytes, neutrophils, eosinophils, basophils, dendritic cells, lymphoblasts), arthritis, inflammation, leukocytosis, 
lymphadenitis, lymphangitis, bacteremia, chronic nonspecific lymphadenitis, psoriasis, wound healing 


liver. 


hepatitis (e.g., types A, B, C), benign epithelial rumors and tumor bile conditions, primary malignant epithelial 
tumors, primary malignant mesenchymal tumors, tumors of the gallbladder or bile duct 


lung: 


lung cancer, small cell lung carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell 
carcinoma, adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic pulmonary 
fibrosis, hypersensitivity pneumonitis,interstitial pneumonitis, nonspecific idiopathic pneumonitis), chronic 
obstructive pulmonary disease (e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, esophageal cancer 


ovary: 


ovarian carcinoma (e.g., epithelial (serous tumors, mucinous tumors, endometrioid tumors), germ cell (e.g., 
teratomas, choriocarcinomas, polyembryomas, embryomal carcinoma, endodermal sinus tumor, dysgerminoma, 
gonadoblastoma), stromal carcinomas (e.g., granulosal stromal cell tumors)), fallopian tube carcinoma, peritoneal 
carcinoma, leiomyoma 


pancreas: 


adenocarcinoma, ductal adenocarcinoma, mucinous cyst adenocarcinoma, acinar cell carcinoma, unclassified large 
cell carcinoma, small cell carcinoma, pancreatoblastoma, duct-ectatic mucin-hypersecreting tumor, mucinous cyst 
adenoma, papillary cystic neoplasm, serous cyst adenoma, diabetes melitis, chronic pancreatitis 


prostate: 


epithelial neoplasms (e.g., adenocarcinoma, small cell tumors, transitional cell carcinoma, carcinoma in situ, and 
basal cell carcinoma), carcinosarcoma, non-epithelial neoplasms (e.g., mesenchymal and lymphoma), germ cell 
tumors, prostatic intraepithelial neoplasia (PIN), hormone independent prostate cancer, benign prostate hyperplasia, 
prostatitis 


skin/melanoma: 


melanoma, lentigo (common benign localized hyperplasia of melanocytes), nevocellular nevi (congenital or 
acquired neoplasm of melanocytes), actinic keratosis (overgrowth of outer layers of skin), basal cell carcinoma, 
Merkel cell carcinoma, benign fibrous histiocytoma (dermal neoplasms of fibroblasts and histiocytes), 
dermatofibrosarcoma protuberans (well differentiated fibrosarcoma of the skin), xanthomas (rumor-like collections 
of foamy histiocytes within the dermis), dermal vascular tumors, seborrheic keratoses (benign tumor), acanthosis 
nigricans (benign or malignant hyperplasia and hyperpigmentation of skin), and squamous cell carcinomas of the 
skin, lung, cervix, esophagus, uterus, head, neck, or bladder 


soft tissue: 


soft tissue tumors (e.g., fibrosarcoma, liposarcoma, leiomyosarcoma, histiocytoma, fibrohistiocytic sarcoma) 
smooth muscle tumors (e.g., rhabdomyoma, rhabdomyosarcoma) tumors of the blood and lymph vessels (e.g., 
angiosarcoma, lymphangtosarcoma, Kaposi's sarcoma), perivascular tumors (e.g., glomus tumors, 
hemangiopericytoma), synovial tumors (e.g., mesothelioma), neural tumors (e.g., neurofibroma, 
neurofibrosarcoma, malignant peripheral nerve sheath tumors, granular cell tumors, plexosarcoraa, 
ganglioneuroblastoma, neuroepithelioma, extraskeletal Ewing's sarcoma, schwannoma, neuroma, ganglioneuroma), 
paraganglioma, extraskeletal cartilaginous and osseous tumors (e.g., chondrosarcoma, osteosarcoma), pluripotential 
mesenchymal tumors, epitheliod sarcomas, rhabdoid tumors, desmoplastic small cell tumors, alveolar sarcoma 


stomach: 


adenocarcinoma, squamous cell carcinoma, adenoacanthoma, carcinoid, leiomyosarcoma, gastritis (chronic 

n*«/«nkin U ni/lnri nccn/>mttv1\ hvn*K>mtac W/* nftlvTV! Jinnnui lp!nm vnm!i psntih n ppfil nnpnnC3rcinomfl5 
arropmc, n. pyiori h&sociomujj, nypcipiaiiiic puij|j%, uijvjuia, i^iviLuyuuia, Mv»]/iiogwii buwiwkbivuiwiiii« 


testicles: 


germ cell tumors (including seminomas, embryonal carcinomas, teratomas, choriocarcinomas, yolk sac tumors), 
sex chord stromal tumors (including Leydig cell tumors, Sertoli cell tumors, and Granulosa cell tumors), germ cell 
and gonadal stromal elements (e.g., gonadoblastomas), adnexal and paratesticular tumors (e.g., mesotheliomas, soft 
tissue sarcomas, and adnexal of the rete testes), miscellaneous neoplasms (including carcinoid, lymphoma, and 
cysts) 


uterus: 


epithelial tumors (e.g., endometrioid, papillary endometrioid, papillary serous, clear cell, mucinous), mesenchymal 
tumors (e.g., endometrial stromal sarcoma, leiomyosarcoma, nonspecific sarcomas), mixed tumors (e.g., malignant 
mixed mullerian tumors, adenosarcoma) 



Table 2: Disease Indications of Selected Genes 

Table 2 provides disease indications for about 59 selected genes. These genes may 
be useful as targets for small molecule, antibody, or DNA vaccine therapy. They may also 
5 have utility as prognostic or diagnostic markers. These genes were identified using 
Eos/Affymetrix Genechip arrays. The columns in Table 2 are as follows: 
Pkey: Unique Eos probeset identifier number 
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Ex Accn: Exemplar Accession number 
UnigenelD: UniGene ID number 
UnigeneTitle: UniGene title 

Disease Indications: Diseases indicated for selected gene as described in Table 1 
5 and abbreviated as follows: 

AWPC (androgen independent prostate diseases), arth (arthritic diseases), bph 
(benign prostatic hyperplasia), blad (bladder diseases), angio (blood vessel diseases), EWS 
(bone diseases), glio (brain diseases), breast (breast diseases), cerv (cervical diseases), colon 
(colorectal diseases), esoph (esophageal diseases), fibro (fibrotic diseases), headnk (head & 

10 neck diseases), leio (leiomyoma diseases), leuk (leukocyte diseases), hepC (liver diseases), 
lung (lung diseases), ovar (ovarian diseases), endo (ovarian endometrioid diseases), omuc 
(ovarian mucinous diseases), pane (pancreatic diseases), pros (prostate diseases), renal 
(renal diseases), mela (skin diseases), stom (stomach diseases), test (testicular diseases), uter 
(uterine diseases) 

15 AA: Refseq amino acid accession number 

NA: Refseq nucleotide accession number 

SEQ ID NOs: Sequence identification numbers linking Pkey to corresponding SEQ 
IDNOs:l-116. 



20 Table 2: Disease Indications of Selected Genes 



Pkey 


Ex Accn 


UnigenelD 


Unigene Title 


Disease Indications 


NA 


AA 


SEQ ID NOs. 


453983 


H94997 


Hs.3 18751 


ESTs 


angio 


FGENESH 


FGENESH 


Seq ID No. 1 & 
59 


453983 


H94997 


Hs.3 18751 


ESTs 


angio 


NMJJ20249.1 


NP 064634.1 


Seq ED No. 2 & 
60 


428758 


AA433988 


Hs.98502 


CA125 antigen; 
mucin 16 


ovar, cerv, lung, 
pane, stom, renal 


NM 002253.1 


NPJW2244.1 


Seq ID No. 3 & 
61 


450983 


AA305384 


Hs.25740 


EROl (S. 
cerevisiae)-Mke 


blad, lung, ovar, 
pane 


NMJH4584.1 


NP 055399.1 


Seq ID No. 4 & 
62 


417771 


AA804698 


Hs.82547 


retmoic acid 
receptor responder 
(tazaro 


blad, cerv, pane, 
pros, ovar 


NM 002888.1 


NPJX)2879.1 


Seq ID No. 5 & 
63 


448262 


AW880830 


Hs. 186273 


Homo sapiens 
quiescin Q6 
(QSCN6) 


blad 


NM_002826.2 


NP 0028I7.2 


Seq ID No. 6 & 
64 


407720 


AB037776 


Hs.38002 


immunoglobulin 
superfamily, 
member 9 


lung 


NM_020789.l 


NPJ65840.1 


Seq ID No. 7 & 
65 


435013 


H91923 


Hs. 1 10024 


NM 020142:Homo 
sapiens 

NADHiubiquinone 

0 


renal, lung, sarc 


NMJJ20142.2 


NPJ64527.1 


Seq ID No. 8 & 
66 


330844 


AA063037 


Hs.66803 


ESTs 


lung 


NM_016247.1 


NPJ)5733U 


Seq ID No. 9 & 
67 


440659 


AF134160 


Hs.7327 


claudin 1 


lung 


NM_02U01 


NP_066924.I 


Seq ID No. 10 & 
68 
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449101 i 


\A205847 


tts.23016 


3 protein-coupled 
•eceptor 


ung, headnk 


XMJ)5 1522.4 : 


XP_05 1522.2 . 


Seq ©No. 11 & 
59 


429263 i 


\A019004 


Hs.198396 


MT-binding 
cassette, sub-family 
MABC1 


lung 


NM_000350.1 


NP_000341.1 


Seq ID No. 12 & 
70 


421474 


U76362 


Hs. 104637 


solute carrier family 
1 (glutamate trans 


lung 


NM.006671.2 


NPJJ06662.2 


Seq ID No. 13 & 
71 


421753 


BE3 14828 


Hs. 107911 


ATP-binding 
cassette, sub-family 
B (MDR/ 


lung 


NM_005689 


NP_005680.1 


Seq ID No. 14 & 
72 


408482 


NM_OO0676 


Hs.45743 


adenosine A2b 
receptor 


lung, esoph, headnk, 
colon 


NM_000676 


NPJJ00667.1 


Seq ID No. 15 & 
73 


426761 


AI0 15709 


Hs.172089 


PORIMDM Pro- 
oncosis receptor 
inducing me 


lung, esoph, pros, 
uter, pane, colon, 
ovar. headnk 


IN JV1__V/ J £tj 3** 


NP 443164 


Seq ID No. 16 & 
74 


429736 


AF125304 


Hs.212680 


tumor necrosis 
factor receptor 
superfami 


lung 


in rvi^uut 1 7j 


NP flfW.186 1 


Seq ID No. 17 & 
75 


430985 


AA490232 


Hs.27323 


ESTs, Weakly 
similar to 178885 
serine/th 


lung 




RAC03767 I 


Seq ID No. 1 8 & 
76 


431890 


X17033 


Hs.271986 


integrin, alpha 2 
(CD49B, alpha 2 
subuni 


blad, headnk, lung, 
pane, cerv, stom 


IN IV1 Uw LLM j<li 


NP 002194 1 


Seq ID No. 19 & 
77 


432583 


AW023624 


Hs. 1 62282 


potassium channel 
TASK-4; potassium 
chan 


lung 


1N1V1_UJ l*tV>\J 


NP 113648 1 


Seq ID No. 20 & 
78 


446872 


X97058 


Hs.16362 


pyrimidinergic 
receptor P2Y, G- 
protein c 


lung 


NM_004154 


NP_004145.1 


Seq ID No. 21 & 
79 


453102 


NM_007197 


Hs.31664 


frizzled 
(Drosophila) 
homolog 10 


lung, headnk, colon 


NM_007197 


NPJ)09128.1 


Seq ID No. 22 & 
80 


404287 


NMJ73674.1 


Hs.449321 


Homo sapiens 
discoidin, CUB and 
LCCL domain 
containing 1 
(DCBLD1) 


pane, lung, colon, 
uter, esoph 


NM_173674.1 


NPJ775945.1 


Seq ID No. 23 & 
81 


404287 


NMJ73674.1 


Hs.449321 


Homo sapiens 
discoidin, CUB and 
LCCL domain 
containing 1 
(DCBLD1) 


pane, lung, colon, 
uter, esoph 


NMJ73674.1 


NP_775945.1 


Seq ID No. 24 & 
82 


418318 


U47732 


Hs.84072 


transmembrane 4 
superfamily 
member 3 


pane, pros, colon, 
stom, omuc 


NM_004616.2 


NPJ04607.1 


Seq ID No. 25 & 
83 


444754 


T83911 


Hs.11881 


transmembrane 4 
superfamily 
member 4 


pane, omuc, stom, 
lung, colon 


NMJ>04617.2 


NP_004608.1 


Seq ID No. 26 & 
84 


428505 


AL035461 


Hs.2281 


chromogranm B 
(secretogranin 1) 


pane, lung 


NM_001819 


NP_001810.1 


SeqIDNo.27& 
85 


448844 


AI581519 


Hs.177164 


FGENESH 
predicted novel cell 
surface pr 


pane, lung, stom, 
omuc 




YP 091082 1 


Seq ID No. 28 & 
86 


448844 


AI581519 


Hs.177164 


\ FGENESH 

predicted novel cell 
surface pr 


pane, lung, stom, 
omuc 


ruti£Ncon 


~ FHFNF^H 


Seq ID No. 29 & 
87 


426227 


U67058 


Hs.l5429S 


) Human proteinase 
activated receptor-2 
mR 


pane, lung, colon, 
, esoph, stom 


xim nn^?d? ? 


NP 005233.2 


" Seq ID No. 30 & 
88 


445417 


AK001058 


Hs, 12680 


a disintegrin-like 
and metalloproteas 
w 


pane, headnk, stom 
£ lung, esoph, sarc, 
colon 


i in ivi_ \jj\jy j j 


NP 112217.1 


Seq ID No. 31 & 
89 


hi j f \y 


RE439580 


Hs 75498 


small inducible 
cytokine subfamily 
A(Cy 


leuk, pane, lung, 
headnk, cerv, color 
uter, stom, esoph 


NM_004591 


NP_0D4582.1 


Seq ID No. 32 & 
90 


416498 


U33632 


Hs.79351 


potassium channel 
subfamily K, 
member 1 


pane, stom, breast, 
endo, colon 


NM_002245.2 


NP_002236.1 


Seq ID No. 33 & 
91 


413095 


AA494359 


Hs.30715 


potassium voltage- 
gated channel, Isk- 
rel 


pane, stom, renal, 
colon 


NMJ)05472.1 


NP_005463. 


SeqIDNo.34& 
92 
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426125 


X87241 


Hs. 166994 


FAT tumor 
suppressor 
(Drosophila) 
homolo 


colon, stom, pane, 
pros, renal, fibro, 
cerv 


NMJ)05245.1 


NPJ)05236.1 


SeqlDNo. 35 & 
93 


436729 


BE621807 


Hs.351316 


transmembrane 4 
superfamily 
member 1 


pane, colon, stom, 
ovar, lung, blad 


NMJM4220.1 


NPJJ55035.1 


SeqlDNo. 36 & 
94 


437145 


AF0O7216 


Hs.5462 


solute carrier family 
4, sodium bicarbon 


pane, pros, stom 


NM_003759.1 


NPJX>3750.\ 


Seq ID No. 37 & 
95 


451820 


AW058357 


Hs. 199248 


ESTs 


pane 


NMJ)00958 


NPJ)00949.1 


SeqlDNo. 38 & 
96 


427557 


NMJJ02659 


Hs. 179657 


plasminogen 
activator, urokinase 
recepto 


pane, colon, stom, 
ovar, cerv, blad, 
lung, headnk, esoph 


NMJW2659.1 


NPJJ02650.1 


SeqlDNo. 39 & 
97 


408308 


AL033377 


Hs.44197 


hypothetical protein 
DKFZp564D0462 


pane, renal, colon 


AK027843.1 


BAB55406.1 


Seq ID No. 40 & 
98 


428242 


H55709 


Hs.2250 


leukemia inhibitory 
factor (cholinergic 


ovar, pane, leuk, 
Jung 


NMJX)2309.2 


NPJJ02300.1 


Seq ID No. 41 & 
99 


428778 


AK000530 


Hs.193326 


fibroblast growth 
factor receptor-like 
1 


ovar 


NM.021923 


NPJJ68742 


Seq ID No. 42 & 
100 


439659 


AW970780 


Hs,59483 


leucine-rich repeat- 
containing G 
protein 


ovar, stom, me la, 
colon 


XMJ>97508 


XP_097508 


Seq ED No. 43 & 
101 


411825 


AK000334 


Hs.352415 


solute carrier family 
39 (zinc transport 


colon, ovar 


NMJ 30849 


NP_570901 


Seq ID No. 44 & 
102 


442133 


AW874138 


Hs.129017 


ESTs; type la 

transmembrane 

protein 


ovar, uter 


XM_087172 


XPJJ87172 


Seq ID No. 45 & 
103 


412314 


AA825247 


Hs.356084 


G protein-coupled 
receptor 27 
(GPR27) (S 


ovar, uter, test 


NMJ)18971 


NP_061844 


Seq ID No. 46 & 
104 


411828 


AW161449 


Hs.72290 


wingless-type 
MMTV integration 
site fami 


ovar 


NM_004625 


NP_004616 


SeqlDNo. 47& 
105 


439668 


AI091277 


Hs.302634 


frizzled 
(Drosophila) 
homolog 8 


ovar, uter 


NM_031866 


NP_1 14072 


SeqlDNo. 48 & 
106 


433336 


AFO 17986 


Hs.31386 


secreted frizzled- 
related protein 2 (str 


ovar, fibro, headnk, 
lung, pane, blad 


XM_050625 


XPJ)50625 


SeqlDNo. 49 & 
107 


432128 


AA127221 


Hs.66 


Interleukin 1 
receptor-like 1 


angio 


BC030975.1 


AAH30975.1 


SeqlDNo. 50 & 
108 


446921 


AB012U3 


Hs.16530 


small inducible 
cytokine subfamily 
A(Cy 


breast, pane, 
headnk, lung, fibro, 
mela 


NMJJ02988.I 


NPJW2979.1 


Seq ID No. 51 & 
109 


450623 


H02562 


Hs.28848 


Nedd4 binding 
protein 3 (N4BP3) 


angio 


XMJB8920.3 


XP_038920.2 


SeqlDNo. 52 & 
110 


450623 


H02562 


Hs.28848 


Nedd4 binding 
protein 3 (N4BP3) 


angio 


FGENESH 


FGENESH 


Seq ID No. 53 & 
111 


432179 


X75208 


Hs.2913 


EphB3 


ovar, colon, lung, 
pros 


NM_004443 


NP_004434.l 


Seq ID No. 54 & 
112 


431870 


AW449902 


Hs.105500 


Homo sapiens POU 
domain, class 5, 
transc 


renal 


FGENESH 


FGENESH 


SeqlDNo. 55 & 
113 


431870 


AW449902 


Hs.105500 


Homo sapiens POU 
domain, class 5, 
transc 


renal 


XMJ75178.1 


XPJ 75 178.1 


SeqlDNo. 56 & 
114 


437212 


A1765021 


Hs.210775 


ESTs 


renal, uter, ovar 


NMJ)0l074.l 


NP^OO 1065.1 


SeqlDNo. 57 & 
115 


442438 


AA995998 


Hs.371863 


gb:os26b03.sl 
NCI_CGAPJCid5 
Homo sapiens 


uter, ovat.renal 


FGENESH 


FGENESH 


SeqlDNo. 58 & 
116 



It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
5 sequences of accession numbers, and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication, accession number, or 
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CLAIMS 

WHAT IS CLAIMED IS: 

1 . A method for detecting a pathological cell in a patient, said method 
comprising detecting in a biological sample from said patient a nucleic acid or polypeptide 

5 comprising a sequence at least 80% identical to a sequence selected from SEQ ID NOs: 1 - 
116. 

2. The method of Claim 1 , wherein said pathological cell has a pathology 
selected from those listed Table 1. 

3 . The method of Claim 1 , wherein said biological sample is tissue from an 
1 0 organ which is affected by a pathology listed in Table 1 . 

4. The method of Claim 1 , wherein said nucleic acids are mRNA. 

5. The method of Claim 1 5 further comprising a step of amplifying nucleic 

acids. 

6. The method of Claim 1, wherein said nucleic acid comprises a sequence 
15 selected from SEQ ID NOs:l-58. 

7. The method of Claim 1 , wherein said polypeptide comprises a sequence 
selected from SEQ ID NOs:59-l 16. 

8. The method of Claim 1 , wherein said detecting comprises using a biochip 
comprising a nucleic acid at least 80% identical to SEQ ID NOs: 1-5 8. 

20 9. The method of Claim 1 , wherein said patient is undergoing a therapeutic 

regimen to treat a pathology selected from those listed Table 1 . 

10. The method of Claim 1 , wherein said patient is suspected of having a 
pathology selected from those listed Table 1 . 

11. An isolated nucleic acid molecule comprising a sequence selected from SEQ 
25 ID NOs: 1-58. 
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12. The nucleic acid molecule of Claim 1 1 , wherein the nucleic acid is labeled. 

13. An expression vector comprising the nucleic acid of Claim 1 1 . 

14. A host cell comprising the expression vector of Claim 13. 

15. An isolated nucleic acid encoding a polypeptide sequence selected from SEQ 
5 IDNOs: 59-116. 

16. An isolated polypeptide encoded by a sequence selected from SEQ ID 
NOs:l-58. 

17. An antibody that specifically binds a polypeptide of Claim 16. 

18. The antibody of Claim 1 7, wherein the antibody is a humanized antibody. 
10 19. The antibody of Claim 1 7, wherein the antibody is an antibody fragment. 

20. The antibody of Claim 17, wherein the antibody is conjugated to an effector 
component. 

21 . The antibody of Claim 17, wherein the antibody is conjugated to a detectable 
label or a cytotoxic chemical. 

15 22. A method for specifically targeting a compound to a pathological cell in a 

patient, said method comprising administering to said patient an antibody of Claim 17, 
wherein said antibody is conjugated to the compound. 

23 . A method for detecting a pathological cell in a patient, said method 
comprising contacting a biological sample with an antibody of Claim 17. 

i 

20 24. The method of Claim 22, wherein said antibody is conjugated to an effector 

component or a fluorescent label. 

25 . The method of Claim 22, wherein said said biological sample is a blood, 
serum, urine, or stool sample. 



-91- 



WO 2004/073657 



PCT/US2004/005455 



26. A method for identifying a compound that modulates a pathology-associated 
polypeptide, said method comprising: 

a) contacting said compound with a pathology-associated polypeptide, said 
polypeptide encoded by a polynucleotide that selectively hybridizes to a 

5 sequence at least 80% identical to SEQ ID NOs:l-58; and 

b) determining the effect of said compound upon the function of said polypeptide. 

27. A screening assay comprising: 

a) administering a test compound to a cell from a mammal exhibiting a pathology 
selected from those listed in Table 1 ; 
1 0 b) administering a test compound to a cell from a mammal not exhibiting said 

pathology; 

c) comparing the expression level of a polynucleotide of the cell comprising a 
sequence at least 80% identical to SEQ ID NOs: 1-58 with the expression level of 
said polynucleotide of a control cell; 

1 5 whereby modulation of the expression level of the polynucleotide of the cell 

indicates that the test compound is a drug candidate. 
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SEQUENCE LISTING 

<110> Azi^, Natasha 
Gish, Kurt 
Wilson, Keith 
Zlotnik, Albert 

<120> METHODS OF DIAGNOSIS OF CANCER, COMPOSITIONS AND 

<130> 05882 . 0191 . OOPCO0 

<160> 116 

<170> Patentln version 3.2 

<210> 1 

<211> 7008 

<212> DNA 

<213> Homo Sapiens 

<400> 1 

atggctatga tgattcttcg ggttgactac acatttgagg aaaatagaga caagttagct 
tccaggaaga aggaatacag tcaaggaagt gtggcagacc tgactccaga caattggaaa 
aacatcaccg tgcctcacag tggaagacat tcagaggtgt ctaggggaga gctggtctgc 
agaacttgct cagaatgttc agctggtccc cacatctgga tgaaagggct ctatcagacc 
caagatgaag aagcaggagg agaaaatatt ttcattctgt tgttcattga gtcaacacaa 
tttggacagt ttgtggccat gggctctccg atcacagaac ataaagtctt taccatgtat 
cttggtttag ccacacatct attttacagc cttataactc acccttttgt tcttttggaa 
aaccactcct gcccaagctc agtccatggg tttgatgtag ctgggctgat ctttgacaaa 
gtgggcatga gatccagacc tggccggatg ggagcactgt ttgcatattt tgctggattt 
atcaggagaa aggcactggt tgtttgtttg tttgtttttt gctggagtaa tgaagctgct 
aacaagcccc ccattcaaga agccgctcag ctatcccggc cagcacaggg cgcccggcgc 
gcctcggagc gcaagttcct cgccttctcc tgcccgctcg ctgggcatta tgcggccaag 
cagccgagcc ccagtcctcc tcctcctcct gctcctccgg ctcctcctgc ggcccgagcg 
gctcagctct cggcaggcgg cggcgttgct cagccgagcg cagacgggac cctcgcagcg 
agacctcagc gactcctaaa gtcaaaagtt ggcggcgggc gccgggctcc gcgcgctctc 
cacggccgct gcctcgcgtc gccgccgcag ccaaggaggg caggagggag gggggtgggg 
gcagcggagg gaggggtggg aagcaccatg cagtttgtat cctgggccac actgctaacg 
ctcctggtgc gggacctggc cgagatgggg agcccagacg ccgcggcggc cgtgcgcaag 
gacaggctgc acccgaggca agtgaaatta ttagagaccc tgagcgaata cgaaatcgtg 
tctcccatcc gagtgaacgc tctcggagaa ccctttccca cgaacgtcca cttcaaaaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



WO 2004/073657 PCT/US2004/005455 

2/282 

acgcgacgga gcattaactc tgccactgac ccctggcctg ccttcgcctc ctcctcttcc 1260 

tcctctacct cctcccaggc gcattaccgc ctctctgcct tcggccagca gtttctattt 1320 

aatctcaccg ccaatgccgg atttatcgct ccactgttca ctgtcaccct cctcgggacg 1380 

cccggggtga atcagaccaa gttttattcc gaagaggaag cggaactcaa gcactgtttc 1440 

tacaaaggct atgtcaatac caactccgag cacacggccg tcatcagcct ctgctcagga 1500 

atggggcttc tggatgtttc agagctttct ggagtttgga ctcggttcag cggcgcgttg 1560 

cccaacgctg cgaggcggcc tggaagtcag tttccaaact cggagaaagt tactggagtt 1620 

gcggttccct gcagcaaact tggccacccg ggagcggagc cgctgagcgc agggcggacc 1680 

agactcctta ttgtggatct tacaaggcat ctgcccccca cttctccacg acatttgagg 1740 

agtcgctgcg gtacggttct ggcccgagca agggtggtcc ttgactttcc caagcgacgt 1800 

gcgtttctgc cacgcgcgtg tgacgcagaa actttcccgg cggggccttg gatactcacc 1860 

cccagacact gggctgctcc atcagtgcgc tgtcgttcgt gggtcttaaa gttcccaagc 1920 

acatccttcc ttctctgcct ttcaatggaa ggatcgggtg gagagcgtgg caagcctgaa 1980 

gactgggagg gggtagttct agcctgctgg gattcaagga aagggataaa cccctttagc 2040 

ccccagcaaa gcgcccggag ccgtggctcc cgaaatgcgc tgtcgagatt gtttgggggt 2100 

gggcggagac ggcagcttgg cgaagtggga gggggtgctg cactgggcac attccggtct 2160 

catgatgggg attattttat tgaaccacta cagtctatgg atgaacaaga agatgaagag 2220 

gaacaaaaca aaccccacat catttatagg cgcagcgccc cccagagaga gccctcaaca 2280 

ggaaggcatg catgtgacac ctcagggtta cagaaatgtc ttataaatgg aagccacgaa 2340 

aacatatatg tgtttgttga atgtttccta gaaacttcag gtttgctcat gttctgtgac 2400 

ttaaggaact gtagcaaggt acctgtacgt tatgctgtga gctacttctg caccccttct 2460 

ttgaattcag atgcagcttc tcagaacagt # ttagaatatg gcacgattca ccagcaggta 2520 

tcagaggaat ggaccaacag gtcaaggaca cctctggaac cagaacacaa aaataggcac 2580 

agtaaagaca agaagaaaac cagagcaaga aaatggggag aaaggattaa cctggctggt 2640 

gacgtagcag cattaaacag cggcttagca acagaggcat tttctgctta tggtaataag 2700 

acggacaaca caagagaaaa gaggacccac agaaggacaa aacgtttttt atcctatcca 2760 

cggtttgtag aagtcttggt ggtggcagac aacagaatgg tttcatacca tggagaaaac 2 820 

cttcaacact atattttaac tttaatgtca attgtagcct ctatctataa agacccaagt 2880 

attggaaatt taattaatat tgttattgtg aacttaattg tgattcataa tgaacaggat 2940 

gggccttcca tatcttttaa tgctcagaca acattaaaaa acttttgcca gtggcagcat 3000 
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L.*—y ciciy <a,ctL-d 


gcccaggcgg 


aatccatcat gatactgctg ttctcttaac aagacaggat 


3060 


aLuLyt ay cly 


cfccacgacaa 


atgtgatacc ttaggcctgg ctgaactggg aaccatttgt 


3120 




gaagcugncc 


tattagtgaa ga tagtggat tgagtacagc ttttacgatc 


3180 


f** /~» *3 f~ /-r ~* r^t /-i 

y Ltcd Ly ciy cj 


tgggccatgt 


gtttaacatg cctcatgatg acaacaacaa atgtaaagaa 


3240 


gaaygagcca 


agagtcccca 


gcatgtcatg gctccaacac tgaacttcta caccaacccc 


3300 


cggacgcggt 


caaagtgtag 


tcgaaaatat atcactgagt ttttagacac tggttatggc 


3360. 


gagtgtttgc 


ttaacgaacc 


tgaatccaga ccctaccctt tgcctgtcca actgccaggc 


3420 


atcctttaca 


acgtgaataa 


acaatgtgaa ttgatttttg gaccaggttc tcaggtgtgc 


3480 


ccatatatgc 


actgcaagta 


tggattttgt gttcccaaag aaatggatgt ccccgtgaca 


3540 


gatggatcct 


ggggaagttg 


gagtcccttt ggaacctgct ccagaacatg tggagggggc 


3600 


atcaaaacag 


ccattcgaga 


gtgcaacaga ccagaaccaa aaaatggtgg aaaatactgt 


3660 


gtaggacgta 


gaatgaaatt 


taagtcctgc aacacggagc catgtctcaa gcagaagcga 


3720 


gacuuccgag 


atgaacagtg 


tgctcacttt gacgggaagc attttaacat caacggtctg 


3780 


/*< 4~ +* **i /-i /•» --\ 4— /-» 

CLtcccaauy 


tgcgctgggt 


ccctaaatac agtggaattc tgatgaagga ccggtgcaag 


3840 


ccgccccgca 


gagtggcagg 


gaacacagcc tactatcagc ttcgagacag agtgatagat 


3900 


yyaaCuCCCt 


gtggccagga 


cacaaatgat atctgtgtcc agggcctttg ccggcaagct 


3960 


ggatgcgafcc 


atgfctttaaa 


ctcaaaagcc cggagagata aatgtggggt ttgtggtggc 


4020 


gataattctt 


catgcaaaac 


agtggcagga acatttaata cagtacatta tggttacaat 


4080 


actgtggtcc 


gaattccagc 


tggtgctacc aatattgatg tgcggcagca cagtttctca 


4140 


ggggaaacag 


acgatgacaa 


ctacttagct ttatcaagca gtaaaggtga attcttgcta 


4200 


aatggaaact 


ttgttgtcac 


aatggccaaa agggaaattc gcattgggaa tgctgtggta 


4260 


gagtacagtg 


ggtccgagac 


tgccgtagaa agaattaact caacagatcg cattgagcaa 


4320 


gaacttxtgc 


ttcaggtttt 


gtcggtggga aagttgtaca accccgatgt acgctattct 


4380 


Lccaatactc 


caattgaaga 


taaacctcag cagttttact ggaacagtca tgggccatgg 


4440 


caagcaugca 


gtaaacccfcg 


ccaaggggaa cggaaacgaa aacttgtttg caccagggaa 


4500 


fccfcgafccagc 


ucacugtctc 


tgatcaaaga tgcgatcggc tgccccagcc tggacacatt 


4560 


actgaaccct 


gtggtacaga 


ctgtgacctg aggtggcatg ttgccagcag gagtgaatgt 


4620 


agtgcccagt 


gtggcttggg 


ttaccgcaca ttggacatct actgtgccaa atatagcagg 


4680 


ctggatggga 


agactgagaa 


ggttgatgat ggtttttgca gcagccatcc caaaccaagc 


4740 


aaccgtgaaa 


aatgctcagg 


ggaatgtaac acgggtggct ggcgctattc tgcctggact 


4800 


gaatgttcaa 


aaagctgtga 


cggtgggacc cagaggagaa gggctatttg tgtcaatacc 


4860 
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cgaaatgatg tactggatga cagcaaatgc acacatcaag agaaagttac cattcagagg 4920 
tgcagtgagt tcccttgtcc acagtggaaa tctggagact ggtcagagtg cttggtcacc 4980 
tgtggaaaag ggcataagca ccgccaggtc tggtgtcagt ttggtgaaga tcgattaaat 5040 
gatagaatgt gtgaccctga gaccaagcca acatctatgc agacttgtca gcagccggaa sioo 
tgtgcatcct ggcaggcggg tccctgggga cagtgcagtg tcacttgtgg acagggatac 5160 
cagctaagag cagtgaaatg catcattggg acttatatgt cagtggtaga tgacaatgac 5220 
tgtaatgcag caactagacc aactgatacc caggactgtg aattaccatc atgtcatcct 5280 
cccccagctg ccccggaaac gaggagaagc acatacagtg caccaagaac ccagtggcga 5340 
tttgggtctt ggaccccatg ctcagccact tgtgggaaag gtacccggat gagatacgtc 54 00 
agctgccgag atgagaatgg ctctgtggct gaogagagtg cctgtgctac cctgcctaga 5460 

ccagtggcaa aggaagaatg ttctgtgaca ccctgtgggc aatggaaggc cttggactgg 5520 

agctcttgct ctgtgacctg tgggcaaggt agggcaaccc ggcaagtgat gtgtgtcaac 5 S8 0 

tacagtgacc acgtgatcga tcggagtgag tgtgaccagg attatatccc agaaactgac 564 0 

caggactgtt ccatgtcacc atgccctcaa aggaccccag acagtggctt agctcagcac 5700 

cccttccaaa atgaggacta tcgtccccgg agcgccagcc ccagccgcac ccatgtgctc 5760 

ggtggaaacc agtggagaac tggcccctgg ggagcaacat attggagaga gaataccatg 5820 

gagtttctag agctgtttct tccagaatct ttaactggac caggtagcaa atcctgtgac 5880 

cagcactatg gaagtacctg tgctggcgga tcccagcggc gtgttgttgt atgtcaggat 5940 

gaaaatggat acaccgcaaa cgactgtgtg gagagaataa aacctgatga gcaaagagcc 6000 

tgtgaatccg gcccttgtcc tcagtgggct tatggcaact ggggagagtg cactaagctg 6060 

tgtggtggag gcataagaac aagactggtg gtctgtcagc ggtccaacgg tgaacggttt 6120 

ccagatttga gctgtgaaat tcttgataaa cctcccgatc gtgagcagtg taacacacat 6180 

gcttgtccac acgacgctgc atggagtact ggcccttgga gctcgtccat gtggcaggtg 6240 

aataataaaa cagttacgct tgggaacttg tgttctgtct cttgtggtcg agggcataaa 6300 

caacgaaatg tttactgcat ggcaaaagat ggaagccatt tagaaagtga ttactgtaag 6360 

cacctggcta agccacatgg gcacagaaag tgccgaggag gaagatgccc caaatggaaa 642 0 

gctggcgctt ggagtcagaa aactactaac tcagactgca ctgaagctga ctgtggtcac 6480 

ctggcagaaa ttgagtctca gtttatcttg gaggttcttg aagaaagggc tgttgacgaa 6540 

agttctagaa aatacctctg cccatttgct tgcttacaaa agtgctctgt gtcctgtggc 6600 

cgaggcgtac agcagaggca tgtgggctgt cagatcggaa cacaoaaaat agccagagag 6660 
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accgagtgca acccatacac cagaccggag tcggaacgcg actgccaagg cccacggtgt 6720 

cccctctaca cttggagggc agaggaatgg caagaaacct accatggcct gctctctcca 6780 

tctccctctt tgtgtcacgc taaactcaac cctgctccga ggagcggaaa gcctcaacct 6840 

agatgtcact tcctctcaga agcctttgcc aatcacacca ccccactaaa tctgagtcag 6900 

atgctcctcc actcagctct cacaacacac gcagattact gtactctggc agttaacacc 6960 

tggaattctc attgcctgtt tttctcatct atgttatcag ttatttaa 7008 

<210> 2 

<211> 3674 

<212> DNA 

<213> Homo Sapiens 

<400> 2 

gcgggaagca ccatgcagtt tgtatcctgg gccacactgc taacgctcct ggtgogggac 60 

ctggccgaga tggggagccc agacgccgcg gcggccgtac gcaaggacag gctgcacccg 120 

aggcaagtga aattattaga gaccctgggc gaatacgaaa tcgtgtctcc catccgagtg 180 

aacgctctcg gagaaccctt tcccacgaac gtccacttca aaagaacgcg acggagcatt 240 

aactctgcca ctgacccctg gcctgccttc gcctcctcct cttcctcctc tacctcctcc 300 

caggcgcatt accgcctctc tgccttcggc cagcagtttc tatttaatct caccgccaat 360 

gccggattta tcgctccact gttcactgtc accctcctcg ggacgcccgg ggtgaatcag 420 

accaagtttt attccgaaga ggaagcggaa ctcaagcact gtttctacaa aggctatgtc 480 

aataccaact ccgagcacac ggccgtcatc agcctctgct caggaatgct gggcacattc 540 

cggtctcatg atggggatta ttttattgaa ccaotacagt ctatggatga acaagaagat 600 

gaagaggaac aaaacaaaoc ccacatcatt tataggcgoa gcgcccccca gagagagccc 660 

tcaacaggaa ggcatgcatg tgacacctca gaacacaaaa ataggcacag taaagacaag 720 

aagaaaacca gagcaagaaa atggggagaa aggattaacc tggctggtga cgtagcagca 780 

ttaaacagcg gcttagcaac agaggcattt tctgcttatg gtaataagac ggacaacaca 840 

agagaaaaga ggacccacag aaggacaaaa cgttttttat cctatccacg gtttgtagaa 900 

gtcttggtgg tggcagacaa cagaatggtt tcataccatg gagaaaacct tcaacactat 960 

attttaactt taatgtcaat tgtagcctct atctataaag acccaagtat tggaaattta 1020 

attaatattg ttattgtgaa cttaattgtg attcataatg aacaggatgg gccttccata 1080 

tcttttaatg ctcagacaac attaaaaaac ctttgccagt ggcagcattc gaagaacagt 1140 

ccaggtggaa tccatcatga tactgctgtt ctcttaacaa gacaggatat ctgcagagct 1200 

cacgacaaat gtgatacctt aggcctggct gaactgggaa ccatttgtga tccctataga 1260 
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agctgttcta ttagtgaaga tagtggattg agtacagctt ttacgatcgc ccatgagctg 1320 

ggccatgtgt ttaacatgcc tcatgatgac aacaacaaat gtaaagaaga aggagtfcaag 1380 

agtccccagc atgtcatggc tccaacactg aacttctaca ccaacccctg gatgtggtca 1440 

aagtgtagtc gaaaatatat cactgagttt etagacactg gttatggcga gtgtttgctt 1500 

aacgaacctg aatccagacc ctaccctttg cctgtccaac tgccaggcat cctttacaac 1560 

gtgaafcaaac aatgtgaatt gatttttgga ccaggttctc aggtgtgccc atatatgatg 1520 

cagtgcagac ggctctggtg caataacgtc aatggagtac acaaaggctg ccggactcag 1680 

cacacaccct gggccgatgg gacggagtgc gagcctggaa agcactgcaa gtatggattt 1740 

tgtgttccca aagaaatgga tgtccccgtg acagatggat cctggggaag ttggagtccc 1800 

tttggaacct gctccagaac atgtggaggg ggcatcaaaa cagccattcg agagtgcaac 1860 

agaccagaac caaaaaatgg tggaaaatac tgtgtaggac gtagaatgaa atttaagtcc 1920 

tgcaacacgg agccatgtct caagcagaag cgagacttcc gagatgaaca gtgtgctcac 1980 

tttgacggga agcattttaa catcaacggt ctgcttccca atgtgcgctg ggtccctaaa 2 040 

tacagtggaa ttctgatgaa ggaccggtgc aagttgttct gcagagtggc agggaacaca 2100 

gcctactatc agcttcgaga cagagtgata gatggaactc cttgtggcca ggacacaaat 2160 

gatatctgtg tccagggcct ttgccggcaa gctggatgcg atcatgtttt aaactcaaaa 2220 

gcccggagag ataaatgtgg ggtttgtggt ggcgataatt cttcatgcaa aacagtggca 2280 

ggaacattta atacagtaca ttatggttac aatactgtgg tccgaattcc agctggtgct 2340 

accaatattg atgtgcggca gcacagtttc tcaggggaaa cagacgatga caactactta 2400 

gctttatcaa gcagtaaagg tgaattettg ctaaatggaa actttgttgt cacaatggcc 24 60 

aaaagggaaa ttcgcattgg gaatgctgtg gtagagtaca gtgggtccga gactgccgta 2520 

gaaagaatta actcaacaga tcgcattgag caagaacttt tgcttcaggt tttgtcggtg 2580 

ggaaagttgt acaaccccga tgtacgctat tctttcaata ttccaattga agataaacct 264 0 

cagcagtttt actggaacag tcatgggcca tggcaagcat gcagtaaacc ctgccaaggg 2700 

gaacggaaac gaaaacttgt ttgcaccagg gaatctgato agcttactgt ttctgatcaa 2760 

agatgcgatc ggctgcccca gcctggacac attactgaac cctgfcggtac agactgtgac 2820 

ctgaggtggc atgttgccag caggagtgaa tgtagtgccc agtgtggctt gggttaccgc 2880 

acattggaca tctactgtgc caaatatagc aggctggatg ggaagactga gaaggttgat 2940 

gatggttttt gcagcagcca tcccaaacca agcaaccgtg aaaaatgctc aggggaatgt 3000 

aacacgggtg gctggcgcta ttctgcctgg actgaatgtt caaaaagctg tgacggtggg 3060 

acccagagga gaagggctat ttgtgtcaat acccgaaatg atgtactgga tgacagcaaa 3120 
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tgcacacatc aagagaaagt taccattcag aggtgcagtg agttcccttg tccacagtgg 3180 

aaatctggag actggtcaga ggtaagatgg gagggctgtt atttccccta ggtcatctct 324 0 

tacattctag ttctggtgct ctctatctgt ttaagacaaa cccttgtgca cctttctccc 3300 

acctctccct ttctcccttg tctcccttga gaaaacaact ccagttctct gcctgcacca 3360 

tgactgtcgt actggatgta actagtctac cagtgacctc agggcacttt gggcttggct 3420 

agatcactca ctgttgtagc ttctgttgtg attttgaagt tgcagtccat caccttccct 3480 

cctctttgag ccctagctaa gtcactgaaa ggaaatcatg gatttattaa tcataaagct 3S40 

atactagctc acatctgaag tcaacatgaa gtttcctact tccttgtctt tgaaataaga 3600 

gaattagacc ccagggagtg acctctctga cttacccatc caactgccca aaaaaaaaaa 3660 
aaaaaaaaaa aaaa 



3674 



<210> 3 

<211> 5830 

<212> DNA 

<213> Homo Sapiens 

<400> 3 

actgagtccc gggaccccgg gagagcggtc agtgtgtggt cgctgcgttt cctctgcctg 60 

cgccgggcat cacttgcgcg ccgcagaaag tccgtctggc agcctggata tcctctccta 120 

ccggcacccg cagacgcccc tgcagccgcc ggtcggcgcc cgggctccct agccctgtgc 180 

gctcaactgt cctgcgctgc ggggtgccgc gagttccacc tccgcgcctc cttctctaga 24 0 

caggcgctgg gagaaagaac cggctcccga gttctgggca tttcgcccgg ctcgaggtgc 3 00 

aggatgcaga gcaaggtgct gctggccgtc gccctgtggc tctgcgtgga gacccgggcc 360 

gcctctgtgg gtttgcctag tgtttctctt gatotgccca ggctcagcat acaaaaagac 420 

atacttacaa ttaaggctaa tacaactctt caaattactt gcaggggaca gagggacttg 480 

gactggcttt ggcccaataa tcagagtggc agtgagcaaa gggtggaggt gactgagtgc 540 

agcgatggcc tcttctgtaa gacactcaca attccaaaag tgatcggaaa tgacactgga 600 

gcctacaagt gcttctaccg ggaaactgac ttggcctcgg tcatttatgt ctatgttcaa 660 

gattacagat ctccatttat tgcttctgtt agtgaccaac atggagtcgt gtacattact 720 

gagaacaaaa acaaaactgt ggtgattcca tgtctcgggt ccatttcaaa tctcaacgtg 780 

tcactttgtg caagataccc agaaaagaga tttgttcctg atggtaacag aatttcctgg 840 

gacagcaaga agggctttac tattcccagc tacatgatca gctatgctgg catggtcttc 900 

tgtgaagcaa aaattaatga tgaaagttac cagtctatta tgtacatagt tgtcgttgta 960 

gggtatagga tttatgatgt ggttctgagt ccgtctcatg gaattgaact atctgttgga 1020 
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gaaaagcttg tcttaaattg 


tacagcaaga 


actgaactaa 


atgtggggat 


tgacttcaac 


1080 


tgggaatacc cttcttcgaa 


gcatcagcat 


aagaaacttg 


taaaccgaga 


cctaaaaacc 


1140 


cagtctggga gtgagatgaa 


gaaatttttg 


agcaccttaa 


ctatagatgg 


tgtaacccgg 


1200 


agtgaccaag gattgtacac 


ctgtgcagca 


tccagtgggc 


tgatgaccaa 


gaagaacagc 


1260 


acatttgtca gggtccatga 


aaaacctttt 


gttgcttttg 


gaagtggcat 


ggaatctctg 


1320 


gtggaagcca cggtggggga 


gcgtgtcaga 


atccctgcga 


agtaccttgg 


ttacccaccc 


1380 


ccagaaataa aatggtataa 


aaatggaata 


ccccttgagt 


ccaatcacac 


aattaaagcg 


1440 


gggcatgtac • tgacgattat 


ggaagtgagt 


gaaagagaca 


caggaaatta 


cactgtcatc 


1500 


cttaccaatc ccatttcaaa 


ggagaagcag 


agccatgtgg 


tctctctggt 


tgtgtatgtc 


1560 


ccaccccaga ttggtgagaa 


atctctaatc 


tctcctgtgg 


attcctacca 


gtacggcacc 


1620 


actcaaacgc tgacatgtac 


ggtctatgcc 


attcctcccc 


cgcatcacat 


ccactggtat 


1680 


tggcagttgg aggaagagtg 


cgccaacgag 


cccagccaag 


ctgtctcagt 


gacaaaccca 


1740 


tacccttgtg aagaatggag 


aagtgtggag 


gacttccagg 


gaggaaataa 


aattgaagtt 


1800 


aataaaaatc aatttgctct 


aattgaagga 


aaaaacaaaa 


ctgtaagtac 


ccttgttatc 


1860 


caagcggcaa atgtgtcagc 


tttgtacaaa 


tgtgaagcgg 


tcaacaaagt 


cgggagagga 


1920 


gagagggtga tctccttcca 


cgtgaccagg 


ggtcctgaaa 


ttactttgca 


acctgacatg 


1980 


cagcccactg agcaggagag 


cgtgtctttg 


tggtgcactg 


cagacagatc 


tacgtttgag 


2040 


aacctcacat ggtacaagct 


tggcccacag 


cctctgccaa 


tccatgtggg 


agagttgccc 


2100 


acacctgttt gcaagaactt 


ggatactctt 


tggaaattga 


atgccaccat 


gttctctaat 


2160 


agcacaaatg acattttgat 


catggagctt 


aagaatgcat 


ccttgcagga 


ccaaggagac 


2220 


tatgtctgcc ttgctcaaga 


caggaagacc 


aagaaaagac 


attgcgtggt 


caggcagctc 


22B0 


acagtcctag agcgtgtggc 


acccacgatc 


acaggaaacc 


tggagaatca 


gacgacaagt 


2340 


attggggaaa gcatcgaagt 


ctcatgcacg 


gcatctggga 


atccccctcc 


acagatcatg 


2400 


tggtttaaag ataatgagac 


ccttgtagaa 


gactcaggca 


ttgtattgaa 


ggatgggaac 


2460 


cggaacctca ctatccgcag 


agtgaggaag 


gaggacgaag 


gcctctacac 


ctgccaggca 


2520 


tgcagtgttc ttggctgtgc 


aaaagtggag 


gcatttttca 


taatagaagg 


tgcccaggaa 


2580 


aagacgaact tggaaatcat 


tattctagta 


ggcacggcgg 


tgattgccat 


gttcttctgg 


2640 


ctacttcttg tcatcatcct 


acggaccgtt 


aagcgggcca 


atggagggga 


actgaagaca 


2700 


ggctacttgt ccatcgtcat 


ggatccagat 


gaactcccat 


tggatgaaca 


ttgtgaacga 


2760 


ctgccttatg atgccagcaa 


atgggaattc 


cccagagacc 


ggctgaagct 


aggtaagcct 


2820 
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cttggccgtg gtgcctttgg ccaagtgatt gaagcagatg cctttggaat tgacaagaca 2880 
gcaacttgca ggacagtagc agtcaaaatg ttgaaagaag gagcaacaca cagtgagcat 2940 
cgagctctca tgtctgaact caagatcctc attcatattg gtcaccatct caatgtggtc 3000 
aaccttctag gtgcctgtac caagccagga gggccactca tggtgattgt ggaattctgc 3060 
aaatttggaa acctgtccac ttacctgagg agcaagagaa atgaatttgt cccctacaag 3120 
accaaagggg cacgattccg tcaagggaaa gactacgttg gagcaatccc tgtggatctg 3180 
aaacggcgct tggacagcat caccagtagc cagagctcag ccagctctgg atttgtggag 324 0 
gagaagtccc tcagtgatgt agaagaagag gaagctcctg aagatctgta taaggacttc 3300 
ctgaccttgg agcatctcat ctgttacagc ttccaagtgg ctaagggcat ggagttcttg 3360 
gcatcgcgaa agtgtatcca cagggacctg gcggcacgaa atatcctctt atcggagaag 3420 
aacgtggtta aaatctgtga ctttggcttg gcccgggata tttataaaga tccagattat 3480 
gtcagaaaag gagatgctcg cctccctttg aaatggatgg ccccagaaac aatttttgac 3540 
agagtgtaca caatccagag tgacgtctgg tcttttggtg ttttgctgtg ggaaatattt 3600 
tccttaggtg cttctccata tcctggggta aagattgatg aagaattttg taggcgattg 3660 
aaagaaggaa ctagaatgag ggcccctgat tatactacac cagaaatgta ccagaccatg 3720 

ctggactgct ggcacgggga gcccagtcag agacccacgt tttcagagtt ggtggaacat 3780 

ttgggaaatc tcttgcaagc taatgctcag caggatggca aagactacat tgttcttccg 3840 

atatcagaga ctttgagcat ggaagaggat tctggactct ctctgcctac ctcacctgtt 3900 

tcctgtatgg aggaggagga agtatgtgac cccaaattcc attatgacaa cacagcagga 3960 

atcagtcagt atctgcagaa cagtaagcga aagagccggc ctgtgagtgt aaaaacattt 4020 

gaagatatcc cgttagaaga accagaagta aaagtaatcc cagatgacaa ccagacggac 4080 

agtggtatgg ttcttgcctc agaagagctg aaaactttgg aagacagaac caaattatct 4140 

coatcttttg gtggaatggt gcccagcaaa agcagggagt ctgtggcatc tgaaggctca 4200 

aaccagacaa gcggctacca gtccggatat cactccgatg acacagacac caccgtgtac 4260 

tccagtgagg aagcagaact tttaaagctg atagagattg gagtgcaaac cggtagcaca 4320 

gcccagattc tccagcctga ctcggggacc acactgagct ctcotcctgt ttaaaaggaa 4380 

gcatccacac cccaactccc ggacatcaca tgagaggtct gctcagattt tgaagtgttg 4440 

ttctttccac cagcaggaag tagccgcatt tgattttcat ttcgacaaca gaaaaaggac 4500 

ctcggactgc agggagccag tcttctaggc atatcctgga agaggcttgt gacccaagaa 4560 

tgtgtctgtg tcttctccca gtgttgacct gatcctcttt tttcattcat ttaaaaagca 4620 

.ttatcatgcc cctgctgcgg gtctcaccat gggtttagaa caaagagctt caagcaatgg 4680 
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ccccatcctc 


aaagaagtag 


cagtacctgg 


ggagctgaca 


cttctgtaaa 


actagaagat 


4740 


aaaccaggca 


acgtaagtgt 


tcgaggtgtt 


gaagatggga 


aggatttgca 


gggctgagtc 


4800 


tatccaagag 


gctttgttta 


ggacgtgggt 


cccaagccaa 


gccttaagtg 


tggaattcgg 


4860 


attgatagaa 


aggaagacta 


acgttacctt 


gctttggaga 


gtactggagc 


ctgcaaatgc 


4920 


attgtgtttg 


ctctggtgga 


ggtgggcatg 


gggtctgttc 


tgaaatgtaa 


agggttcaga 


4980 


c 9999tttct 


ggttttagaa 


ggttgcgtgt 


tcttcgagtt 


gggctaaagt 


agagttcgtt 


5040 


gtgctgtttc 


tgactcctaa 


tgagagttcc 


ttccagaccg 


ttagctgtct 


ccttgccaag 


5100 


ccccaggaag 


aaaatgatgc 


agctctggct 


ccttgtctcc 


caggctgatc 


ctttattcag 


5160 


aataccacaa 


agaaaggaca 


ttcagctcaa 


ggctccctgc 


cgtgttgaag 


agttctgact 


5220 


gcacaaacca 


gcttctggtt 


tcttctggaa 


tgaataccct 


catatctgtc 


ctgatgtgat 


5280 


atgtctgaga 


ctgaatgcgg 


gaggttcaat 


gtgaagctgt 


gtgtggtgtc 


aaagtttcag 


5340 


gaaggatttt 


acccttttgt 


tcttccccct 


gtccccaacc 


cactctcacc 


ccgcaaccca 


5400 


tcagtatttt 


agttatttgg 


cctctactcc 


agtaaacctg 


attgggtttg 


ttcactctct 


5460 


gaatgattat 


tagccagact 


tcaaaattat 


tttatagccc 


aaattataac 


atctattgta 


5520 


ttatttagac 


ttttaacata 


tagagctatt 


tctactgatt 


tttgcccttg 


ttctgtcctt 


5580 


tttttcaaaa 


aagaaaatgt 


gttttttgtt 


tggtaccata 


gtgtgaaatg 


ctgggaacaa 


5640 


tgactataag 


acatgctatg 


gcacatatat 


ttatagtctg 


tttatgtaga 


aacaaatgta 


5700 


atatattaaa 


gccttatata 


taatgaactt 


tgtactattc 


acattttgta 


tcagtattat 


5760 


gtagcataac 


aaaggtcata 


atgctttcag 


caattgatgt 


cattttatta 


aagaacattg 


5820 


aaaaacttga 












5830 


<210> 4 

<211> 3334 

<212> DNA 

<213> Homo Sapiens 












<400> 4 

gcacgagccc cgggctgccg 


gcgcgggcgc 


cgcggcacgt 


ccacaggctg 


ggtcgcgagg 


60 


tggcgatcgc tgagaggcag 


gagggccgag 


gcgggcctgg 


gaggcggccc 


ggaggtgggg 


120 


cgccgctggg gccggcccgc 






pap rjrj CCCClCl 




t no 


gcggactggc 


ctcccaagcg 


tggggcgaca 


agctgccgga 


gctgcaatgg 


gccgcggctg 


240 


gggattcttg tttggcctcc 


tgggcgccgt 


gtggctgctc 


agctcgggcc 


aeggagagga 


300 


gcagcccccg 


gagacagcgg 


cacagaggtg 


cttctgccag 


gttagtggtt 


acttggatga 


360 


ttgtacctgt gatgttgaaa 


ccattgatag 


atttaataac 


tacaggcttt 


tcccaagact 


420 
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acaaaaactt cttgaaagtg actactttag gtattacaag gtaaacctga agaggccgtg 480 

tcctttctgg aatgacatca gccagtgtgg aagaagggac tgtgctgtca aaccatgtca 540 

atctgatgaa gttcctgatg gaattaaatc tgcgagctac aagtattctg aagaagccaa 600 

taatctcatt gaagaatgtg aacaagctga acgacttgga gcagtggatg aatctctgag 660 

tgaggaaaca cagaaggctg ttcttcagtg gaccaagcat gatgattctt cagataactt 720 

ctgtgaagct gatgacattc agtcccctga agctgaatat gtagatttgc ttcttaatcc 78 0 

tgagcgctac actggttaca agggaccaga tgcttggaaa atatggaatg tcatctacga 840 

agaaaactgt tttaagccac agacaattaa aagaccttta aatcctttgg cttctggtca 900 

agggacaagt gaagagaaca ctttttacag ttggctagaa ggtctctgtg' tagaaaaaag 960 

agcattctac agacttatat ctggcctaca tgcaagcatt aatgtgcatt tgagtgcaag 1020 

atatctttta caagagacct ggttagaaaa gaaatgggga cacaacatta cagaatttca 1080 

acagcgattt gatggaattt tgactgaagg agaaggtcca agaaggctta agaacttgta 114 0 

ttttctctac ttaatagaac taagggcttt atccaaagtg ttaccattct tcgagcgccc 1200 

agattttcaa ctctttactg gaaataaaat tcaggatgag gaaaacaaaa tgttacttct 1260 

ggaaatactt catgaaatca agtcatttcc tttgcatttt gatgagaatt cattttttgc 1320 

tggggataaa aaagaagcac acaaactaaa ggaggacttt cgactgcatt ttagaaatat 1380 

ttcaagaatt atggattgtg ttggttgttt taaatgtcgt ctgtggggaa agcttcagac 144 0 

tcagggtttg ggcactgctc tgaagatctt attttctgag aaattgatag caaatatgcc 1500 

agaaagtgga cctagttatg aattccatct aaccagacaa gaaatagtat cattattcaa 1560 

cgcatttgga agaatttcta caagtgtgaa agaattagaa aacttcagga acttgttaca 1620 

gaatattcat taaagaaaac aagctgatat gtgcctgttt ctggacaatg gaggcgaaag 1680 

agtggaattt cattcaaagg cataatagca atgacagtct taagccaaac attttatata 1740 

aagttgcttt tgtaaaggag aattatattg ttttaagtaa acacattttt aaaaattgtg 1800 

ttaagtctat gtataatact actgtgagta aaagtaatac tttaataatg tggtacaaat i860 

tttaaagttt aatattgaat aaaaggagga ttatcaaatt catatatgat aaaagtgaat 1920 

gttctaagtc tctcaaacta gcgttttatg taataatatg taatataaat aaaactatgg 1980 

taaatgtgac aagcatttaa taggaaaatg ctaaggaggc ctcataaatg acccataatt 2040 

accaacgtag aatttttcag tacatttagg gttgctggat ttagcaaata aaaataaaga 2100 

ttgcccagtt agatttgaat ttcagataaa caattagttt tttaatattt tacatggaat 2160 

atttggaaaa tacttatact aaaaaattat ttgtttgaaa ttcacattta actgggagtc 2220 
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ttgtatttta tctggcaatc ctaaaataca ttggtatgaa acaaatcact tttagaagta 2280 

tattgctatt ttgattgggt tgtttttgtg tgtagaaacg tacaataaca actcaaaggc 2340 

acaggagatt tctaaacatt gtgaaaagtt gaatagatta tatatttatt ctcataatac 2400 

tttcactaat actaaataaa atttggggaa cactttttat ttttatataa tttccaattt 2460 

acagaaaagt ttcaaaaata gtacaaagag ctctcttacc cagattcact aattgttcat 2520 

acgtgcttta tctttcatgc tttctctgta cacacacaca cacacacaaa tttttcctca 2580 

atcatttgaa agtcagttat aggcatcatg ccccttaaac cctaaatact tcagtgtgta 2640 

atactgaata attactaaaa atgattttct cagaaaaaaa aactcccaca attctggaac 2700 

tataatactg taagccttag aataaataat actttcaagt tccaatctaa agttcttttt 2760 

gagttttgtt gcccgtttta tgcttgatgt gtatagtaat agggtaggct atttatttta 2820 

ttaaaatttt ttttagagac aaggttttgc tgtgttgccc aagctggaac ttgaacgact 2880 

gggctgaagt gatcttccca cctcagcctc ccaagtagct gggaatacag gtgtctgcca 2940 

ccatacccag tttcattttt gttttttata cccgaagttc atttcctttg tctccctaaa 3000 

actgaactgt aattttggga ggttttcatt agtggaagct cttcatttat aaagctattt 3060 

gaaggggttt aggaatttat atcacatggt aattgtagag aaaaagaagc tatatacctc 3120 

aaaatcgtgc cctctttaca tatgtcttat caggtataac atgttgaaat gtcacattag 3180 

tagtaaagtg gggtttattt atatagtggt taagaaatgt cagtttacac tgctgtatac 3240 

ttcttcttct gtgtccctaa ggcctggtac agtgccaagc acatacttgg tatccaataa 3300 

atatttgttg gatgaaaaaa aaaaaaaaaa aaaa 3 334 

<210> 5 

<211> 840 

<212> DNA 

<213> Homo Sapiens 

<400> 5 

ccacgtccgg ggtgccgagc caactttcct gcgtccatgc agccccgccg gcaacggctg 60 

cccgctccct ggtccgggcc caggggcccg cgccccaccg ccccgctgct cgcgctgctg 120 

ctgttgctcg ccccggtggc ggcgcccgcg gggtccgggg gccccgacga ccctgggcag 180 

cctcaggatg ctggggtccc gcgcaggctc ctgcagcaga aggcgcgcgc ggcgcttcac 240 

ttcttcaact tccggtccgg ctcgcccagc gcgctgcgag tgctggccga ggtgcaggag 300 

ggccgcgcgt ggattaatcc aaaagaggga tgtaaagttc acgtggtctt cagcacagag 360 

cgctacaacc cagagtcttt acttcaggaa ggtgagggac gtttggggaa atgttctgct 420 

cgagtgtttt tcaagaatca gaaacccaga ccaaccatca atgtaacttg tacacggctc 480 
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atcgagaaaa agaaaagaca acaagaggat tacctgcfctt acaagcaaat gaagcaactg 540 

aaaaacccct tggaaatagt cagcatacct gataatcatg gacatattga tccctctctg 600 

agactcatct gggatttggc tttccttgga agctcttacg tgatgtggga aatgacaaca 660 

caggtgtcac actactactt ggcacagctc actagtgtga ggcagtgggt aagaaaaacc 72 0 

tgaaaattaa cttgtgccac aagagttaca atcaaagtgg tctccttaga ctgaattcat 7 80 

gtgaacttct aatttcatat caagagttgt aatcacattt atttcaataa atatgtgagt 840 

<210> 6 

<211> 3314 

<212> DNA 

<213> Homo Sapiens 

<400> 6 

ggaggcaggc ggtgccgcgg cgccgggacc cgactcatcc ggtgcttgcg tgtggtggtg 60 

agcgcagcgc cgaggatgag gaggtgcaac agcggctccg ggccgccgcc gtcgctgctg 120 

ctgctgctgc tgtggctgct cgcggfctccc ggcgctaacg cggccccgcg gtcggcgctc 180 

tattcgcctt ccgacccgct gacgctgctg caggcggaca cggtgcgcgg cgcggtgctg 240 

ggctcccgca gcgcctgggc cgtggagttc ttcgcctcct ggtgcggcca ctgcatcgcc 300 

ttcgccccga cgtggaaggc gctggccgaa gacgtcaaag cctggaggcc ggccctgtat 360 

ctcgccgccc tggactgtgc tgaggagacc aacagtgcag tctgcagaga cttcaacatc 420 

cctggcttcc cgactgtgag gttcttcaag gcctttacca agaacggctc gggagcagta 480 

tttccagtgg ctggtgctga cgtgcagacg ctgcgggaga ggctcattga cgccctggag 540 

tcccatcatg acacgtggcc cccagcctgt cccccactgg agcctgccaa gctggaggag 600 

attgatggat tctttgcgag aaataacgaa gagtacctgg ctctgatctt tgaaaaggga 660 

ggctcctacc tgggfcagaga ggtggctctg gacctgtccc agcacaaagg cgtggcggtg 720 

cgcagggtgc tgaacacaga ggccaatgtg gtgagaaagt ttggtgtcac cgacttcccc 780 

tcttgctacc tgctgttccg gaatggctct gtctcccgag tccccgtgct catggaatcc 84 0 

aggtccttct ataccgctta cctgcagaga ctctctgggc tcaccaggga ggctgcccag ■ 900 

accacagttg caccaaccac tgctaacaag atagctccca ctgtttggaa attggcagat 960 

cgctccaaga tctacatggc tgacctggaa tctgcactgc actacatcct gcggatagaa 1020 

gtgggcaggt tcccggtcct ggaagggcag cgcctggtgg ccctgaaaaa gtttgtggca 1080 

gtgctggcca agtatttccc tggccggccc ttagtccaga acttcctgca ctccgtgaat 114 0 

gaatggctca agaggcagaa gagaaataaa attccctaca gtttctttaa aactgccctg 12 00 

gacgacagga aagagggtgc cgttcttgcc aagaaggtga actggattgg ctgccagggg 1260 



WO 2004/073657 PCTAJS2004/005455 

14/282 

agtgagccgc atttccgggg ctttccctgc tccctgtggg tcctcttcca cttcttgact 1320 

gtgcaggcag ctcggcaaaa tgtagaccac tcacaggaag cagccaaggc caaggaggtc 13 80 

ctcccagcca tccgaggcta cgtgcactac ttcttcggct gccgagactg cgctagccac 1440 

ttcgagcaga tggctgctgc ctccatgcac cgggtgggga gtcccaacgc cgctgtcctc 1500 

tggctctggt ctagccacaa cagggtcaat gctcgccttg caggtgcccc cagcgaggac 1560 

ccccagttcc ccaaggtgca gtggccaccc cgtgaacttt gttctgcctg ccacaatgaa 162 0 

cgcctggatg tgcccgtgtg ggacgtggaa gccaccctca acttcctcaa ggcccacttc 1680 

tccccaagca acatcatcct ggacttccct gcagctgggt cagctgcccg gagggatgtg 1740 

cagaatgtgg cagccgcccc agagctggcg atgggagccc tggagctgga aagccggaat 1800 

tcaactctgg accctgggaa gcctgagatg atgaagtccc ccacaaacac caccccacat i860 

gtgccggctg agggacctga ggcaagtcga cccccgaagc tgcaccctgg cctcagagct 192 0 

gcaccaggcc aggagcctcc tgagcacatg gcagagcttc agaggaatga gcaggagcag 1980 

ccgcttgggc agtggcactt gagcaagcga gacacagggg ctgcattgct ggctgagt cc 2040 

agggctgaga agaaccgcct ctggggccct ttggaggtca ggcgcgtggg ccgcagctcc 2100 

aagcagctgg tcgacatccc tgagggccag ctggaggccc gagctggacg gggccgaggc 2160 

cagtggctgc aggtgctggg agggggcttc tcttacctgg acatcagcct ctgtgtgggg 222 0 

ctctattccc tgtccttcat gggcctgctg gccatgtaca cctacttcca ggccaagata 2280 

agggccctga agggccatgc tggccaccct gcagcctgaa ccacctgggg aggaggcggg 2340 

agagggagct gccatctcta ggcacctcaa gccccctgac cccattccct cccctcccac 2400 

cccttgctcc ttgtctggcc tagaagtgtg ggaaattcag gaaaacgagt tgctccagtg 2460 

aagcttcttg gggttgctag gacagagagc tcctttgaca caaaagacag gagcagggtc 2520 

caggttcccc tgctgtgcag ggagggcagc cccgggcagt gggcataggg cagctcagtc 2 580 

cctggcctct tagcaccaca ttcctgtttt tcagcttatt tgaagtcctg cctcattctc 2640 

actggagcct cagtctctcc tgcttggtct tggccctcaa ctggggcaag tgaagccaga 2700 

ggagggtccc ccagctgggt gggctggaat ggaactcctc actagctgct ggggctccgc 2760 

ccaccctgct cccttccgga caatgaagaa gcctttgcac cctgggagga aggaccaccc 2820 

cgggccctct atgcctggcc agcctccagc tcctcagacc tcctgggtgg ggtttggctt 2880 

cagggtgggg tttggaagct tctggaagtc gtgctggtct cccaggtgag gcaagccatg 2940 

gttgctgggc tgtagggtga gtggcttgct tggtgggacc tgacgagttg gtggcatggg 3000 

aaggatgtgg gtctctagtg ccttgccctg gcttagctgc aggagaagat ggctgctttc 3060 

acttcccccc attgagctct gctccctctg agcctggtct tttgtccttt tttattttgg 3120 
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tctccaagat gaatgctcat ctttggaggg tgccaggtag aagctaggga ggggagtgtc 3180 

ttctctctcc aggtttcacc ttccagtgtg cagaagttag aagggtctgg cgggggcagt 3240 

gccttacaca tgcttgattc ccacgctacc ccctgccttg ggaggtgtgt ggaataaatt 3300 

atttttgtta aggc 3314 

<210> 7 

<211> 4020 

<212> DNA 

<213> Homo Sapiens 

<400> 7 

ggcacgaggg tggagccgag cggtgcggag cagatctggt ggttctccgg agagcagctt 60 

ccttgggtgt tacatgagcc aagccctcac tgtacagaag agtgagagct gaaacctgtt 120 

ccctgagctg atcagaagga catcccttgg cccctccatc tgggctcctg tggataggag 180 

gggctgggtg agcaggccag ctgggctatg gtgtggtgcc tcggcctggc cgtcctcagc 240 

ctggtcatca gccagggggc tgacggtcga gggaagcctg aggtggtatc ggtggtgggc 300 

cgggctgagg agagtgtggt gctgggctgt gacctgctgc ccccggccgg ccggcccccc 360 

ctgcatgtca tcgagtggct gcgcttfcgga ttcctgcttc ccatcttcat ccagttcggc 42 0 

ctctactctc cccgaattga ccctgattac gtgggacgag tccggctgca gaagggggcc 480 

tctctccaga ttgagggtct ccgggtggaa gaccagggct ggtacgagtg ccgcgtgttc 540 

ttcctggacc agcacatccc tgaagacgat tttgctaacg gctcctgggt gcatctgaca 600 

gtcaattcac cccctcaatt ccaggagaca cctcctgctg tgttggaagt gcaggaactg 660 

gagcctgtga ccctgcgttg tgtggcccgt ggcagccccc tgcctcatgt gacgtggaag 720 

ctccgaggaa aggaccttgg ccagggccag ggccaggtgc aagtgcagaa cgggacgctg 780 

cggatccgcc gggtagagcg aggcagctct ggggtctaca cctgccaagc ctccagcact 840 

gagggcagcg ccacccacgc cacccagctg ctagtgctag gacccccagt catcgtggtg 900 

ccccccaaga acagcacagt caatgcctcc caggatgttt cattggcctg ccatgctgag 960 

gcataccctg ctaacctcac ctacagctgg ttccaggaca acatcaatgt cttccacatt 1020 

agccgcctgc agccccgggt gcagatcctg gtggacggga gcctgcggct gctggccacc 1080 

cagcctgatg atgccggctg ctacacctgt gtgcccagca atggcctcct gcatccaccc 1140 

tcagcctctg cctacctcac tgtgctccgc atgccggggg tgatccgctg cccggttcgt 1200 

gccaaccccc cactgctctt tgtcagctgg accaaggatg gaaaggccct gcagctggac 1260 

aagttccctg gctggtccca gggcacagaa ggctcactga tcatcgccct ggggaacgag 1320 

gatgccctgg gagaatactc ctgcaccccc tacaacagtc ttggtaccgc cgggccctct 1380 
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cctgtgaccc gcgtgctgct caaggctccc ccagctttta tagagcggcc caaggaagaa 1440 

tatttccaag aagtagggcg ggagctgctc atcccctgct ccgcccaagg ggaccctcct 1500 

cctgttgtct cttggaccaa ggtgggccgg gggctgcaag gccaggccca ggtggacagc 1560 

aacagcagcc tcatcctgcg accattgacc aaggaggccc acgggcactg ggaatgcagt 1620 

gccagcaatg ctgtggcccg agtggccacc tccacgaacg tctacgtgct gggcactagc 1680 

cctcatgttg tcaccaatgt gtccgtggtg gctttgccca agggtgccaa tgtctcctgg 1740 

gagcctggct ttgatggtgg ttatctgcag agattcagtg tctggtacac cccactggcc 1800 

aagcgtcctg accgaatgca ccatgactgg gtgtccttgg cagtgcctgt gggggctgct i860 

cacctcctag tgccagggct gcagccccac acccagtacc agttcagcgt gctagctcag 1920 

aacaagctgg ggagtggtcc cttcagcgaa atcgtcttgt ctgctccgga agggcttcct 1980 

accacgccag ctgcacccgg gcttccccca acagagatac cgcctcccct gtcccctccg 2 040 

cggggtctgg tggcagtgag gacaccccgg ggggtactcc tgcattggga tcccccagag 2100 

ctggtcccta agagactgga tggctacgtc ttggaaggcc ggcaaggctc ccagggctgg 2160 

gaggtgctgg acccggctgt ggcaggcaca gaaacagagc tgctggtgcc aggcctcatc 2220 

aaggatgttc tctacgagtt ccgcctcgtg gccttcgcgg gcagcttcgt cagcgacccc 22 80 

agcaacacgg ccaacgtctc cacttccggt ctggaggtct acccttcgcg cacgcagctg 2340 

ccgggcctcc tgcctcagcc cgtgctggcc ggcgtggtgg gcggagtctg ctttctggga 2400 

gtggccgtcc ttgtgagcat cctggccggc tgcctcctga accggcgcag ggctgcccgc 2460 

cgccgccgca agcgcctccg ccaagatcca cctcttatct tctctccgac cgggaagtca 2520 

gctgcaccct ctgctctggg ctcaggcagt cctgacagcg tggcgaagct gaagctccag 2580 

ggatccccag tccccagcct gcgccagagt ctgctctggg gggatcctgc cggaactccc 2640 

agcccccacc cggatcctcc atctagccgg ggacccttac ctctggagcc catttgccgg 2700 

ggcccagacg ggcgctttgt gatggggccc actgtggcgg ccccccagga aaggtcaggc 2760 

cgggagcagg cagaacctcg gactccagcc cagcgtctgg cccggtcctt tgactgtagc 2820 

agcagcagcc ccagtggggc accccagccc ctctgcattg aagacatcag ccctgtggca 2 880 

ccccctccag cagccccacc cagtcccttg ccaggtcctg gacccctgct ccagtacctg 2940 

agcctgccct tcttccgaga gatgaatgtg gatggggact ggcccccgct tgaggagccc 3 0 00 

agccctgctg cacccccaga ttacatggat acccggcgct gtcccacctc atctttcctt 3060 

cgttctccag aaacccctcc tgtatccccc agggaatcac ttcctggggc tgtggtaggg 3120 

gctggggcca ctgcagagcc cccttacaca gccctggctg actggacact gagggagcgg 3180 
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ctgctgccag gccttctccc tgctgcccct cgaggcagcc tcaccagcca gagcagcggg 3240 

cgaggcagcg cttcgttcct gcggcccccc tccacagccc cctctgcagg aggcagctac 3300 

ctcagccctg ctccaggaga caccagcagc tgggccagtg gccctgagag atggccccga 3360 

agggagcatg tggtgacagt cagcaagagg aggaacacat ctgtggacga gaactatgag 3420 

tgggactcag aattccctgg ggacatggaa ttgctggaga ctttgcacct gggcttggcc 3480 

agctcccggc tcagacctga agctgagaca gagctaggtg tgaagactcc agaggagggc 3540 

tgcctcctga acactgccca tgttactggc cctgaggccc gctgtgctgc octtcgggag 3600 

gaattcctgg ccttccgccg ccgccgagat gctactaggg ctcggctacc agcctatcga 3660 

cagccagtcc cccaccccga acaggccact ctgctgtgaa catccctaat gtgaggctgt 3720 

gaaaaggcat atggacctgc aaaggaggcc cccaaccaga cagacctagt ttcaaacgag 3780 

ggcactgcco ctgcctgccc ctttggtgcc caggcacaga ccctgatagt gggtttgggt 3840 

caccttggta tggaatgtat gtgctgaccc cctaggtgag tctggggatt ggaacaggga 3900 

tcttaggtct gcctctctct ctctctctct ctctctctct ctctctctgt gtgtgtgtgt 3960 

gtgtgtgaag ttttttacag gtgaataaac aaagtttgaa agaaaaaaaa aaaaaaaaaa 4020 

<210> 8 

<211>< / 1284 

<212> DNA 

<213> Homo Sapiens 

<400> 8 

ggcacgaggg tctccgcctg caggtgcaga catotggagg agagagtcgg agagcagaaa 60 

ccacttggct cccagacaat tcccctacag gctttgggcc tggaattgag gagaaagtga 120 

gctaagttgg ggtggggtga gtccaaagaa gcacgggctg ggccaagcta agctgctctg 180 

ggctgggctg atccctcccc actcaggggc gggacccoag gaggagggag aggacagagc 240 

cactgcagag gaccagactg ggaaaacaac gatatggcag gagccagtct tggggcocgc 300 

ttctaccggc agatoaaaag aoatccgggg atcatcccga tgatcggctt aatctgcctg 360 

ggcatgggca gcgctgcgct ttacttgctg cgactcgccc ttcgcagccc cgacgtctgc 420 

tgggacagaa agaacaaccc ggagccctgg aaccgcctga gccccaatga ccaatacaag 480 

ttccttgcag tttccactga ctataagaag ctgaagaagg accggccaga cttctaagcc 540 

aggctgggct gccagtgcca tgcaagccac agccagccag cccatccact tcttccactc 600 

ctccccgcag gccccaaggc atcactccgg ccaccctgtc ccgctactgc ttacacaggc 660 

cgggttccca cgcagagggg aggctgctcc acccctactc tcctcccttg ctcccagcag 72 0 

cggaagcgcc tctgaccctt ggcttgagtc ccacgtgggg gaggaggagg caggcagcac 78 0 
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cagcaggggt ccaccaagag cccagaccag cccctctgcc ctcctacccg ggcctcgaag 84 0 

ggtgtggcac aggctacgtg ttgagcgtgg cctacgtgag ccaacaagaa gcaggggcct 900 

ctgagtgcca agcgacgtgg cgggctccac gttagcccag gctctgagag ccagcccagg 960 

ggcggcgctg ctcagcttgg gctggtccag ggcctgccca ggctggggca cctttgcctc 102 0 

ctgaggcgca gcgcactcct cccctgccca agcctactgc ctcccgctgc cgccagtacc 1080 

ccctccagcc ccacacctgg gcctccccct gccactcccc tcccttgctc ccctctgtcc 1140 

ccagggatca aacagaagca gccgtgggca aaatacaatt tcatttaaca aattgaaaaa 1200 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1260 

aaaaaaaaaa aaaaaaaaaa aaaa 1284 

<210> 9 

<211> 4165 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> misc_feature 

<222> (4076) . . (4076) 

<223> n is a, c, g, or t 

<220> 

<221> mis cofeature 

<222> (4091) . . (4091) 

<223> n is a, c, g, or t 

<400> 9 



cgggctactt 


tgaaaggaca 


accatttttc tttccgctaa tttataatgg ttttgaagtg 


60 


gttgttcatt 


ctcaaacata 


gacttttaaa tgttaggtct ttcctataac tctttgttat 


120 


tggaagtttc 


aaggatttgg 


acactcaatt aaggattctg tcctctcctc attcctttgg 


180 


ttttggccca 


aatgattatg 


tttcctcttt ttgggaagat ttctctgggt attttgatat 


240 


ttgtcctgat 


agaaggagac 


tttccatcat taacagcaca aacctactta tctatagagg 


300 


agatccaaga 


acccaagagt 


gcagtttctt ttctcctgcc tgaagaatca acagaccttt 


360 


ctctagctac 


caaaaagaaa 


cagcctctgg accgcagaga aactgaaaga cagtggttaa 


420 


tcagaaggcg 


gagatctatt 


ctgtttccta atggagtgaa aatctgccca gatgaaagtg 


480 


ttgcagaggc 


tgtggcaaat 


catgtgaagt attttaaagt ccgagtgtgt caggaagctg 


540 


tctgggaagc 


cttcaggact 


ttttgggatc gacttcctgg gcgtgaggaa tatcattact 


600 


ggatgaattt 


gtgtgaggat 


ggagtcacaa gtatatttga aatgggcaca aattttagtg 


660 


aatctgtgga 


acatagaagc 


ttaatcatga agaaactgac ttatgcaaag gaaactgtaa 


720 


gcagctctga 


actgtcttct 


ccagttcctg ttggtgatac ttcaacattg ggagacacta 


780 
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ctctcagtgt 


tccacatcca 


gaggtggacg 


cctatgaagg 


tgcctcagag agcagcttgg 


840 


aaaggccaga 


ggagagtatt 


agcaatgaaa 


ttgagaatgt 


gatagaagaa gccacaaaac 


900 


cagcaggtga 


acagattgca 


gaattcagta 


tccacctttt 


ggggaagcag tacagggaag 


960 


aactacagga 


ttcctccagc 


tttcaccacc 


agcaccttga 


agaagaattt atttcagagg 


1020 


ttgaaaatgc 


atttactggg 


ttaccaggct 


acaaggaaat 


tcgtgtactt gaatttaggt 


1080 


cccccaagga 


aaatgacagt 


ggcgtagatg 


tttactatgc 


agttaccttc aatggtgagg 


1140 


ccatcagcaa 


taccacctgg 


gacctcatta 


gccttcactc 


caacaaggtg gaaaaccatg 


1200 


gccttgtgga 


actggatgat 


aaacccactg 


ttgtttatac 


aatcagtaac ttcagagatt 


1260 


atattgctga 


gacattgcag 


cagaattttt 


tgctggggaa 


ctcttccttg aatccagatc 


1320 


ctgattccct 


gcagcttatc 


aatgtgagag 


gagttttgcg 


tcaccaaact gaagatctag 


1380 


tttggaacac 


ccaaagttca 


agtcttcagg 


caacgccgtc 


atctattctg gataatacct 


1440 


ttcaagctgc 


atggccctca 


gcagatgaat 


ccatcaccag 


cagtattcca ccacttgatt 


1500 


tcagctctgg 


tcctccctca 


gccactggca 


gggaactctg 


gtcagaaagt cctttgggtg 


1560 


atttagtgtc 


tacacacaaa 


ttagcctttc 


cctcgaagat 


gggcctcagc tcttccccag 


1620 


aggttttaga 


ggttagcagc 


ttgactcttc 


attctgtcac 


cccggcagtg cttcagactg 


1680 


gcttgcctgt 


ggcttctgag 


gaaaggactt 


ctggatctca 


cttggtagaa gatggattag 


1740 


ccaatgttga 


agagtcagaa 


gattttcttt 


ctattgattc 


attgccttca agttcattca 


1800 


ctcaacctgt 


gccaaaagaa 


acaataccat 


ccatggaaga 


ctctgatgtg tccttaacat 


1860 


cttcaccata 


tctgacctct 


tctatacctt 


ttggcttgga 


ctccttgacc tccaaagtca 


1920 


aagaccaatt 


aaaagtgagc 


cctttcctgc 


cagatgcatc 


catggaaaaa gagttaatat 


1980 


ttgacggtgg 


tttaggttca 


gggtctgggc 


aaaaggtaga 


tctgattact tggccatgga 


2040 


gtgagacttc 


atcagagaag 


agcgccgaac 


cactgtccaa 


gccgtggctt gaaqataatQ 


2100 


attcactttt 


gccagctgag 


attgaagaca 


agaaactagt 


tttagttgac aaaatggatt 


2160 


ccacagacca 


aattagtaag 


cactcaaaat 


atgaacatga 


tgacagatcc acacactttc 


2220 


cagaggaaga 


gcctcttagt 


gggcctgctg 


tgcccatctt 


cgcagatact gcagctgaat 


2280 


i- uy t-y u^Ccu 


»~\ *"i r+ f~~* s*t /"i r~i f~% 


aagcacatat 


cagaagtacc 


tggtgttgat gattgctcag 


2340 


ttaccaaagc 


acctcttata 


ctgacatctg 


tagcaatctc 


tgcctctact gataaatcag 


2400 


atcaggcaga 


tgccatccta 


agggaggata 


tggaacaaat 


tactgagtca tccaactatg 


2460 


aatggtttga 


cagtgaggtt 


tcaatggtaa 


agccagatat 


gcaaactttg tggactatat 


2520 


tgccagaatc 


agagagagtt 


tggacaagaa 


cttcttccct 


agagaaattg tccagagaca 


2580 
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tattggcaag 


tacaccacag 


agtgctgaca 


ggctctggtt 


atctgtgaca 


cagtctacca 


2640 


aattgcctcc 


aaccacaatc 


tccaccctgc 


tagaggatga 


agtaattatg 


ggtgtacagg 


2700 


atatttcgtt 


agaactggac 


cggataggca 


cagattacta 


tcagcctgag 


caagtccaag 


2760 


agcaaaatgg 


caaggttggt 


agttatgtgg 


aaatgtcaac 


aagtgttcac 


tccacagaga 


2820 


tggttagtgt 


ggcttggccc 


acagaaggag 


gagatgactt 


gagttatacc 


cagacttcag 


2880 


gagctttggt 


ggttttcttc 


agcctccgag 


tgactaacat 


gatgttttca 


gaagatctgt 


2940 


ttaataaaaa 


ctccttggag 


tataaagccc 


tggagcaaag 


attcttagaa 


ttgctggttc 


3000 


cctatctcca 


gtcaaatctc 


acggggttcc 


agaacttaga 


aatcctcaac 


ttcagaaatg 


3060 


gcagcattgt 


ggtgaacagt 


cgaatgaagt 


ttgccaattc 


tgtccctcct 


aacgtcaaca 


3120 


atgcggtgta 


catgattctg 


gaagactttt 


gtaccactgc 


ctacaatacc 


atgaacttgg 


3180 


ctattgataa 


atactctctt 


gatgtggaat 


caggtgatga 


agccaaccct 


tgcaagtttc 


3240 


aggcctgtaa 


tgaattttca 


gagtgtctgg 


tcaacccctg 


gagtggagaa 


gcaaagtgca 


3300 


gatgcttccc 


tggatacctg 


agtgtggaag 


aacggccctg 


tcagagtctc 


tgtgacctac 


3360 


agcctgactt 


ctgcttgaat 


gatggaaagt 


gtgacattat 


gcctgggcac 


ggggccattt 


3420 


gtaggtgccg 


ggtgggtgag 


aactggtggt 


accgaggcaa 


gcactgtgag 


gaatttgtgt 


3480 


ctgagcccgt 


gatcataggc 


atcactattg 


cctccgtggt 


tggacttctt 


gtcatctttt 


3540 


ctgctatcat 


ctacttcttc 


atcaggactc 


ttcaagcaca 


ccatgacagg 


agtgaaagag 


3600 


agagtccctt 


cagtggctcc 


agcaggcagc 


ctgacagcct 


ctcatctatt 


gagaatgctg 


3660 


tgaagtacaa 


ccccgtgtat 


gaaagtcaca 


gggctggatg 


tgagaagtat 


gagggaccct 


3720 


atcctcagca 


tcccttctac 


agctctgcta 


gcggagacgt 


gattggtggg 


ctgagcagag 


3780 


aagaaatcag 


acagatgtat 


gagagcagtg 


agctttccag 


agaggaaatt 


caagagagaa 


3840 


tgagagtttt 


ggaactgtat 


gccaatgatc 


ctgagtttgc 


agcttttgtg 


agagagcaac 


3900 


aagtggaaga 


ggtttaacca 


aaactcctgt 


tctgaaactg 


attagaagcc 


tggagaagat 


3960 


ggagattact 


tgttacttat 


gtcatataat 


taacctggat 


tttaaacact 


gttggaagaa 


4020 


gagttttcta 


tgaaaaaatt 


aaatataggg 


cacactgttt 


ttttttcagc 


ttaagntttc 


4080 


agaatgtagt 


nagagatgtw 


mcatttttat 


ttctataaag 


actgaatgct 


gtgtttaaat 


4140 


aattgaaaac 


tacgttaaaa 


aaaaa 








4165 



<210> 10 

<211> 1237 

<212> DNA 

<213?> Homo Sapiens 



<400> 10 
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gaattcggca 


cgaggcctcg 


tgccggggag caacctcagc ttctagtatc cagactccag 


60 


cgccgccccg 


ggcgcggacc 


ccaaccccga cccagagctt ctccagcggc ggcgcagcga 


120 


gcagggctcc 


ccgccttaac 


ttcctccgcg gggcccagcc accttcggga gtccgggttg 


180 


cccacctgca 


aactctccgc 


cttctgcacc tgccacccct gagccagcgc gggcgcccga 


240 


gcgagtcatg 


gccaacgcgg 


ggctgcagct gttgggcttc attctcgcct tcctgggatg 


300 


gatcggcgcc 


atcgtcagca 


ctgccctgcc ccagtggagg atttactcct atgccggcga 


360 


caacatcgtg 


accgcccagg 


ccatgtacga ggggctgtgg atgtcctgcg tgtcgcagag 


420 


caccgggcag 


atccagtgca 


aagtctttga ctccttgctg aatctgagca gcacattgca 


480 


agcaacccgt 


gccttgatgg 


tggttggcat cctcctggga gtgatagcaa tctttgtggc 


540 


caccgttggc 


atgaagtgta 


tgaagtgctt ggaagacgat gaggtgcaga agatgaggat 


600 


ggctgtcatt 


gggggcgcga 


tatttcttct tgcaggtctg gctattttag ttgccacagc 


660 


atggtatggc 


aatagaatcg 


ttcaagaatt ctatgaccct atgaccccag tcaatgccag 


72 0 


gtacgaattt 


ggtcaggctc 


tcttcactgg ctgggctgct gcttctctct gccttctggg 


780 


aggtgcccta 


ctttgctgtt 


cctgtccccg aaaaacaacc tcttacccaa caccaaggcc 


840 


ctatccaaaa 


cctgcacctt 


ccagcgggaa agactacgtg tgacacagag gcaaaaggag 


900 


aaaatcatgt 


tgaaacaaac 


cgaaaatgga cattgagata ctatcattaa cattaggacc 


960 


ttagaatttt 


gggtattgta 


atctaaagta tgttattaca aaacaaacaa acaaacaaaa 


1020 


aacccatgtg 


ttaaaatact 


cagtgctaaa catggcttaa tcttatttta tcttctttcc 


1080 


tcaatatagg 


agggaagatt 


tttccatttg tattactgct tcccattgag taatcatact 


1140 


caaatggggg 


aaggggtgct 


ccttaaatat atatagatat gtatatatac atgtttttct 


1200 


attaaaaata 


gccagtaaaa 


aaaaaaaaaa aaaaaaa 


1237 


<210> ll 

<211> 2010 

<212> DNA 

<213> Homo Sapiens 






<400> 11 
acagttgttg 


caaagtgctc 


agcactaagg gagccagcgc acagcacagc caggaaggcg 


60 


agcgagccca 


gccagcccag 


ccagcccagc cagcccggag gtcatttgat tgcccgcctc 


120 


agaacgatgg 


atctgcatct 


cttcgactac tcagagccag ggaacttctc ggacatcagc 


180 


tggccatgca acagcagcga 


ctgcatcgtg gtggacacgg tgatgtgtcc caacatgccc 


240 


aacaaaagcg 


tcctgctcta 


cacgctctcc ttcatttaca ttttcatctt cgtcatcggc 


300 


atgattgcca actccgtggt 


ggtctgggtg aatatccagg ccaagaccac aggctatgac 


360 
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acgcactgct acatcttgaa cctggccatt gccgacctgt gggttgtcct caccatccca 420 

gtctgggtgg tcagtctcgt gcagcacaac cagtggccca tgggcgagct cacgtgcaaa 480 

gtcacacacc tcatcttctc catcaacctc ttcggcagca ttttcttcct cacgtgcatg 54 o 

agcgtggacc gctacctctc catcacctac ttcaccaaca cccccagcag caggaagaag 600 

atggtacgcc gtgtcgtctg catcctggtg tggctgctgg ccttctgcgt gtctctgcct 660 

gacacctact acctgaagac cgtcacgtct gcgtccaaca atgagaccta ctgccggtcc 720 

ttctaccccg agcacagcat caaggagtgg ctgatcggca tggagctggt ctccgttgtc 780 

ttgggctttg ccgttccctt ctccattatc gctgtcttct acttcctgct ggccagagcc 840 

atctcggcgt ccagtgacca ggagaagcac agcagccgga agatcatctt ctcctacgtg 900 

gtggtcttcc ttgtctgctg gctgccctac cacgtggcgg tgctgctgga catcttctcc 960 

atcctgcact acatcccttt cacctgccgg ctggagcacg ccctcttcac ggccctgcat 10 20 

gtcacacagt gcctgtcgct ggtgcactgc tgcgtcaacc ctgtcctcta cagcttcatc 1080 

aatcgcaact acaggtacga gctgatgaag gccttcatct tcaagtactc ggccaaaaca U40 

gggctcacca agctcatcga tgcctccaga gtctcagaga cggagtactc tgccttggag 1200 

cagagcacca aatgatctgc cotggagagg ctctgggacg ggtttacttg tttttgaaca 1260 

9ggtgatggg ccctatggtt ttctagagca aagcaaagta gcttcgggtc ttgatgcttg 1320 

agtagagtga agaggggagc acgtgccccc tgcatcoatt ctctctttct cttgatgacg i 38 0 

cagctgtcat ttggctgtgc gtgctgacag ttttgcaaca ggcagagctg tgtcgcacag 1440 

cagtgctgtg cgtcagagcc agctgaggac aggcttgcct ggacttctgt aagataggat i 500 

tttctgtgtt tcctgaattt tttatatggt gatttgtatt taaattttaa gactttattt i 56 0 

tctcactatt ggtgtaoctt ataaatgtat ttgaaagtta aatatatttt aaatattgtt 1620 

tgggaggcat agtgctgaca tatattcaga gtgttgtagt tttaaggtta gcgtgacttc 1680 

agttttgact aaggatgaca ctaattgtta gctgttttga aattatatat atataaatat 1740 

atataaatat ataaatatat gccagtcttg gctgaaatgt tttatttacc atagttttat 1800 

atctgtgtgg tgttttgtac cggcacggga tatggaacga aaactgcttt gtaatgcagt i860 

ttgtgacatt aatagtattg taaagttaca ttttaaaata aacaaaaaac tgttctggac 1920 

tgcaaatctg cacacacaac gaacagttgc atttcagaga gttctctcaa tttgtaagtt 1980 
attttttttt aataaagatt tttgtttcct 

2010 

<210> 12 

<211> 7318 

<212> DNA 

<213> Homo Sapiens 
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<400? 12 

ctggctctta acggcgttta tgtcctttgc tgtctgaggg gcctcagctc tgaccaatct 60 

ggtcttcgtg tggtcattag catgggcttc gtgagacaga tacagctttt gctctggaag 120 

aactggaccc tgcggaaaag gcaaaagatt cgctttgtgg tggaactcgt gtggccttta 180 

tctttatttc tggtcttgat ctggttaagg aatgccaacc cgctctacag ccatcatgaa 240 

tgccatttcc ccaacaaggc gatgccctca gcaggaatgc tgccgtggct ccaggggatc 300 

ttctgcaatg tgaacaatcc ctgttttcaa agccccaccc caggagaatc tcctggaatt 360 

gtgtcaaact ataacaactc catcttggca agggtatatc gagattttca agaactcctc 420 

afcgaatgcac cagagagcca gcaccttggc cgtatttgga cagagctaca catcttgtcc 480 

caattcatgg acaccctccg gactcacccg gagagaattg caggaagagg aatacgaata 540 

agggatatct tgaaagatga agaaacactg acactatttc tcattaaaaa catcggcctg 600 

tctgactcag tggtctacct tctgatcaac tctcaagtcc gtccagagca gttcgctcat 660 

ggagtcccgg acctggcgct gaaggacatc gcctgcagcg aggccctcct ggagcgcttc 720 

atcatcttca gccagagacg cggggcaaag acggtgcgct atgccctgtg ctccctctcc 780 

cagggcaccc tacagtggat agaagacact ctgtatgcca acgtggactt cttcaagctc 840 

ttccgtgtgc ttcccacact cctagacagc cgttctcaag gtatcaatct gagatcttgg 900 

ggaggaatat tatctgatat gtcaccaaga attcaagagt ttatccatcg gccgagtatg 960 

caggacttgc tgtgggtgac caggcccctc atgcagaatg gtggtccaga gacctttaca 1020 

aagctgatgg gcatcctgtc tgacctcctg tgtggctacc ccgagggagg tggctctcgg 1080 

gtgctctcct tcaactggta tgaagacaat aactataagg cctttctggg gattgactcc 1140 

acaaggaagg atcctatcta ttcttatgac agaagaacaa catccttttg taatgcattg 1200 

atccagagcc tggagtcaaa tcctttaacc aaaatcgctt ggagggcggc aaagcctttg 1260 

ctgatgggaa aaatcctgta cactcctgat tcacctgcag cacgaaggat actgaagaat 1320 

gccaactcaa cttttgaaga actggaacac gttaggaagt tggtcaaagc ctgggaagaa 1380 

gtagggcccc agatctggta cttctttgac aacagcacac agatgaacat gatcagagat 1440 

accctgggga acccaacagt aaaagacttt ttgaataggc agcttggtga agaaggtatt 1500 

actgctgaag ccatcctaaa cttcctctac aagggccctc gggaaagcca ggctgacgac 1560 

atggccaact tcgactggag ggacatattt aacatcactg atcgcaccct ccgcctggtc 1620 

aatcaatacc tggagtgctt ggtcctggat aagtttgaaa gctacaatga tgaaactcag 1680 

ctcacccaac gtgccctctc tctactggag gaaaacatgt tctgggccgg agtggtattc 1740 

cctgacatgt atccctggac cagctctcta ccaccccacg tgaagtataa gatccgaatg 1800 
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gacatagacg tggtggagaa aaccaataag attaaagaca ggtattggga ttctggtccc i860 
agagctgatc ccgtggaaga tttccggtac atctggggcg ggtttgccta tctgcaggac 1920 
atggttgaac aggggatcac aaggagccag gtgcaggcgg aggctccagt tggaatctac 1980 
ctccagcaga- tgccctaccc ctgcttcgtg gacgattctt tcatgatcat cctgaaccgc 2040 
tgtttcccta tcttcatggt gctggcatgg atctactctg tctccatgac tgtgaagagc 2100 
atcgtcttgg agaaggagtt gcgactgaag gagaccttga aaaatcaggg tgtctccaat 2160 
gcagtgattt ggtgtacctg gttcctggac agcttctcca tcatgtcgat gagcatcttc 2220 
ctcctgacga tattcatcat gcatggaaga atcctacatt acagcgaccc attcatcctc 2280 
ttcctgttct tgttggcttt ctccactgcc accatcatgc tgtgctttct gotcagcacc 2340 
ttcttctcca aggccagtct ggcagcagcc tgtagtggtg tcatctattt caccctctac 2400 
ctgccacaca tcctgtgctt cgcctggcag gaccgcatga ccgctgagct gaagaaggct 2460 
gtgagcttac tgtctccggt ggcatttgga tttggcactg agtacctggt tcgctttgaa 2520 
gagcaaggcc tggggctgca gtggagcaac atcgggaaca gtcccacgga aggggacgaa 2580 
ttcagcttcc tgctgtccat gcagatgatg ctccttgatg ctgcgtgcta tggcttactc 2640 
gcttggtacc ttgatcaggt gtttccagga gactatggaa ccccacttcc ttggtacttt 2700 

cttctacaag.agtcgtattg gcttagcggt gaagggtgtt caaccagaga agaaagagcc 2760 

ctggaaaaga ccgagcccct aacagaggaa acggaggatc cagagcaccc agaaggaata 2820 

cacgactcct tctttgaacg tgagcatcca gggtgggttc ctggggtatg cgtgaagaat 2880 

ctggtaaaga tttttgagcc ctgtggccgg ccagctgtgg accgtctgaa catcaccttc 2 940 

tacgagaacc agatcaccgc attcctgggc cacaatggag ctgggaaaac caccaccttg 3000 

tccatcctga cgggtctgtt gccaccaacc tctgggaetg tgctcgttgg gggaagggac 3060 

attgaaacca gcctggatgc agtccggcag agccttggca tgtgtccaca gcaoaacatc 3120 

ctgttccacc acctcacggt ggctgagcac atgctgttct atgcccagct gaaaggaaag 3180 

tcccaggagg aggcccagct ggagatggaa gccatgttgg aggacacagg cctccaccac 3240 

aagcggaatg aagaggctca ggacctatca ggtggcatgc agagaaagct gtcggttgcc 3300 

attgcctttg tgggagatgc caaggtggtg attctggacg aacccacctc tggggtggac 3360 

ccttactcga gacgctcaat ctgggatctg ctcctgaagt atcgctcagg cagaaccatc 3420 

atcatgccca ctcaccacat ggacgaggcc gaccaccaag gggaccgcat tgccatcatt 3480 

gcccagggaa ggctctactg ctcaggcacc ccactcttcc tgaagaactg ctttggcaca 35 4 o 

ggcttgtact taaccttggt gcgcaagatg aaaaacatcc agagccaaag gaaaggcagt 3600 
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gaggggacct 


gcagctgctc 


gtctaagggt 


ttctccacca 


cgtgtccagc 


ccacgtcgat 


3660 


gacctaactc 


cagaacaagt 


cctggatggg 


gatgtaaatg 


agctgatgga 


tgtagttctc 


3720 


caccatgttc 


cagaggcaaa 


gctggtggag 


tgcattggtc 


aagaacttat 


cttccttctt 


3780 


ccaaataaga 


acttcaagca 


cagagcatat 


gccagccttt 


tcagagagct 


ggaggagacg 


3840 


ctggctgacc 


ttggtctcag 


cagttttgga 


atttctgaca 


ctcccctgga 


agagattttt 


3900 


ctgaaggtca 


cggaggattc 


tgattcagga 


cctctgtttg 


cgggtggcgc 


tcagcagaaa 


3960 


agagaaaacg 


tcaacccccg 


acacccctgc 


ttgggtccca 


gagagaaggc 


tggacagaca 


4020 


ccccaggact 


ccaatgtctg 


ctccccaggg 


gcgccggctg 


ctcacccaga 


gggccagcct 


4080 


cccccagagc 


cagagtgccc 


aggcccgcag 


ctcaacacgg 


ggacacagct 


ggtcctccag 


4140 


catgtgcagg 


cgctgctggt 


caagagattc 


caacacacca 


tccgcagcca 


caaggacttc 


4200 


ctggcgcaga 


tcgtgctccc 


ggctaccttt 


gtgtttttgg 


ctctgatgct 


ttctattgtt 


4260 


atccttcctt 


ttggcgaata 


ccccgctttg 


acccttcacc 


cctggatata 


tgggcagcag 


4320 


tacaccttct 


tcagcatgga 


tgaaccaggc 


agtgagcagt 


tcacggtact 


tgcagacgtc 


4380 


ctcctgaata 


agccaggctt 


tggcaaccgc 


tgcctgaagg 


aagggtggct 


tccggagtac 


4440 


ccctgtggca 


actcaacacc 


ctggaagact 


ccttctgtgt 


ccccaaacat 


cacccagctg 


4500 


ttccagaagc 


agaaatggac 


acaggtcaac 


ccttcaccat 


cctgcaggtg 


cagcaccagg 


4560 


gagaagctca 


ccatgctgcc 


agagtgcccc 


gagggtgccg 


ggggcctccc 


gcccccccag 


4620 


agaacacagc 


gcagcacgga 


aattctacaa 


gacctgacgg 


acaggaacat 


ctccgacttc 


4680 


ttggtaaaaa 


cgtatcctgc 


tcttataaga 


agcagcttaa 


agagcaaatt 


ctgggtcaat 


4740 


gaacagaggt 


atggaggaat 


ttccattgga 


ggaaagctcc 


cagtcgtccc 


catcacgggg 


4800 


gaagcacttg 


ttgggttttt 


aagcgacctt 


ggccggatca 


tgaatgtgag 


cgggggccct 


4860 


atcactagag 


aggcctctaa 


agaaatacct 


gatttcctta 


aacatctaga 


aactgaagac 


4920 


aacattaagg 


tgtggtttaa 


taacaaaggc 


tggcatgccc 


tggtcagctt 


tctcaatgtg 


4980 


gcccacaacg 


ccatcttacg 


ggccagcctg 


cctaaggaca 


ggagccccga 


ggagtatgga 


5040 


atcaccgtca 


ttagccaacc 


cctgaacctg 


accaaggagc 


agctctcaga 


gattacagtg 


5100 


ctgaccactt 


cagtggatgc 


tgtggttgcc 


atctgcgtga 


ttttctccat 


gtccttcgtc 


5160 


ccagccagct 


ttgtccttta 


tttgatccag 


gagcgggtga 


acaaatccaa 


gcacctccag 


5220 


tttatcagtg 


gagtgagccc 


caccacctac 


tgggtgacca 


acttcctctg 


ggacatcatg 


5280 


aattattccg 


tgagtgctgg 


gctggtggtg 


ggcatcttca 


tcgggtttca 


gaagaaagcc 


5340 


tacacttctc 


cagaaaacct 


tcctgccctt 


gtggcactgc 


tcctgctgta 


tggatgggcg 


5400 


gtcattccca 


tgatgtaccc 


agcatccttc 


ctgtttgatg 


tccccagcac 


agcctatgtg 


5460 
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gctttatctt gtgctaatct gttcatcggc atcaacagca gtgctattac cttcatcttg 5520 

gaattatttg ataataaccg gacgctgctc aggttcaacg ccgtgctgag gaagctgctc 5580 

attgtcttcc cccacttctg cctgggccgg ggcctcattg accttgcact gagccaggct 5640 

gtgaCagatg tctatgcccg gtttggtgag gagcactctg caaatccgtt ccactgggac 5700 

ctgattggga agaacctgtt tgccatggtg gtggaagggg tggtgtactt cctcctgacc 5760 

ctgctggtcc agcgccactt cttcctctcc caatggattg ccgagcccac taaggagccc 5820 

attgttgatg aagatgatga tgtggctgaa gaaagacaaa gaattattac tggtggaaat 5880 

aaaactgaca tcttaaggct acatgaacta accaagattt atctgggcac ctccagccca 5940 

gcagtggaca ggctgtgtgt cggagttcgc cctggagagt gctttggcct cctgggagtg 6000 

aatggtgccg gcaaaacaac cacattcaag atgctcactg gggacaccac agtgacctca 6060 

ggggatgcca ccgtagcagg caagagtatt ttaaccaata tttctgaagt ccatcaaaat 6120 

atgggctact gtcctcagtt tgatgcaatc gatgagctgc tcacaggacg agaacatctt 6180 

tacctttatg cccggcttcg aggtgtacca gcagaagaaa tcgaaaaggt tgcaaactgg 6240 

agtattaaga gcctgggcct gactgtctac gccgactgcc tggctggcac gtacagtggg 63 00 

ggcaacaagc ggaaactctc cacagccatc gcactcattg gctgcccacc gctggtgctg 6360 

ctggatgagc ccaccacagg gatggacccc caggcacgcc gcatgctgtg gaacgtcatc 6420 

gtgagcatca tcagaaaagg gagggctgtg gtcctcacat cccacagcat ggaagaatgt 6480 

gaggcactgt gtacccggct ggccatcatg gtaaagggcg cctttcgatg tatgggcacc 654 0 

attcagcatc tcaagtccaa atttggagat ggctatatcg tcacaatgaa gatcaaatcc 6600 

ccgaaggacg acctgcttcc tgacctgaac cctgtggagc agttcttcca ggggaacttc 6660 

ccaggcagtg tgcagaggga gaggcactac aacatgctcc agttccaggt ctcctcctcc 6720 

tccctggcga ggatcttcca gctcctcctc tcccacaagg acagcctgct catcgaggag 6780 

tactcagtca cacagaccac actggaccag gtgtttgtaa attttgctaa acagcagact 6840 

gaaagtcatg acctccctct gcaccctcga gctgctggag ccagtcgaca agcccaggac 6900 

tgatctttca caccgctcgt tcctgcagcc agaaaggaac tctgggcagc tggaggcgca 6960 

ggagcctgtg cccatatggt catccaaatg gactggccca gcgtaaatga ccccactgca 7020 

gcagaaaaca aacacacgag gagcatgcag cgaattcaga aagaggtctt tcagaaggaa 7080 

accgaaactg acttgctcac ctggaacacc tgatggtgaa accaaacaaa tacaaaatcc 7140 

ttctccagac cccagaacta gaaaccccgg gccatcccac tagcagcttt ggcctccata 7200 

ttgctctcat ttcaagcaga tctgcttttc tgcatgtttg tctgtgtgtc tgcgttgtgt 7260 
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gtgattttca tggaaaaata aaatgcaaat gcactcatca caaaaaaaaa aaaaaaaa 7318 

<210> 13 

<211> 2663 

<212> DNA 

<213> Homo Sapiens 

<400> 13 

ggcacgaggc tggtgtttag caactccgac cacctgcctg ctgaggggct agagccctca 60 

gcccagaccc tgtgcccccg gccgggctct catgcgtgga atggtgctgt gccccttgcc 120 

agcaggccag gctcaccatg gtgccgcatg ccatcttggc acgggggagg gacgtgtgca 180 

ggcggaatgg actcctcatc ctgtctgtgc tgtctgtcat cgtgggctgc ctcctcggct 240 

tcttcttgag gacccggcgc ctctcaccac aggaaattag ttacttccag ttccctggag 300 

agctcctgat gaggatgctg aagatgatga tcctgccact ggtggtctcc agcttgatgt 360 

ccggacttgc ctccctggat gccaagacct ctagccgcct gggcgtcctc accgtggcgt 420 

actacctgtg gaccaccttc atggctgtca tcgtgggcat cttcatggtc tccatcatcc 480 

acccaggcag cgcggcccag aaggagacca cggagcagag tgggaagccc atcatgagct 540 

cagccgatgc cctgttggac ctcatccgga acatgttccc agccaaccta gtagaagcca 600 

cattcaaaca gtaccgcacc aagaccaccc cagttgtcaa gtcccccaag gtggcaccag 660 

aggaggcccc tcctcggcgg atcctcatct acggggtcca ggaggagaat ggctcccatg 720 

tgcagaactt cgccctggac ctgaccccgc cgcccgaggt cgtttacaag tcagagccgg 780 

gcaccagcga tggcatgaat gtgctgggca tcgtcttctt ctctgccacc atgggcatca 840 

tgctgggccg catgggtgac agcggggccc ccctggtcag cttctgccag tgcctcaatg 900 

agtcggtcat gaagatcgtg gcggtggctg tgtggtattt ccccttcggc attgtgttcc 960 

tcattgcggg taagatcctg gagatggacg accccagggc cgtcggcaag aagctgggct 1020 

tctactcagt caccgtggtg tgcgggctgg tgctccacgg gctctttatc ctgcccctgc 1080 

tctacttctt catcaccaag aagaatccca tcgtcttcat ccgcggcatc ctgcaggctc 1140 

tgctcatcgc gctggccacc tcctccagct cagccacact gcccatcacc ttcaagtgcc 1200 

tgctggagaa caaccacatc gaccggcgca tcgctcgctt cgtgctgccc gtgggtgcca 1260 

ccatcaacat ggacggcact gcgctctacg aggctgtggc cgccatcttc atcgcccagg 1320 

tcaacaacta cgagctggac tttggccaga tcatcaccat cagtatcaca gccactgcag 1380 

ccagcattgg ggcagctggc atcccccagg ccggcctcgt caccatggtc atcgtgctca 1440 

cctccgtggg actgcccacc gatgacatca ccctcatcat tgccgttgac tgggctctgg 1500 

accgtttccg caccatgatt aacgtgctgg gtgatgcgct ggcagcgggg atcatggccc 1560 
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atatatgtcg gaaggatttt gcccgggaca caggcaccga gaaactgctg ccctgcgaga 1620 

ccaagccagt gagcctccag gagatcgtgg cagcccagca gaatggctgt gtgaagagtg 1680 

tagccgaggc ctccgagctc accctgggcc ccacctgccc ccaccacgtc cccgttcaag 1740 

tggagcggga tgaggagctg cccgctgcga gtctgaacca ctgcaccatc cagatcagcg 1800 

agctggagac caatgtctga gcctgcggag ctgcaggggc aggcgaggcc tccaggggca 1860 

gggtcctgag gcaggaactc gactctccaa ccctcctgag cagccggcag gggccaggat 1920 

cacacattct tctcaccctt gagaggctgg aattaacccc gcttgacgga aaatgtatct 1980 

cagagaaggg aaaggctgca tgggggagcc ccatccaggg agtgatgggc ccggcattgc 2040 

ctgaggcccc gctgtgacag tttccccggt gtgagcccgg tgagggcggc aggcaggggt 210 0 

tatccggccc cactttctgg atgacagact tgaggctctg agagctgaaa acacttgtcc 2160 

aaggtctcac gttaaggtca agacactaac tcaaatcttt caagccccgc ctctcctctt 2220 

ggaggacagg gcagcctgca gctgtgtcca ggcccaggcc ccaccccata acaggtggcc 2280 

tcagccacac agttctcccc aaggggagca gcccagggcc aagccccgct gccttcccca 234 0 

ggccacagtg cgtccagtct cctgtcctgc cacgtgtctt ttgcaaagct ccttggatgt 2400 

ggagacagat gtctttacta gagctgaaag gcccccttga cacatccagg ccaacctccc 2460 

atggaatagg taggcaagcc aggactccgg gaaggaggtg cagccaggat gctctggtgg 2520 

agctgccgat ggggccctgg tgtcagaact ccccaaaggc ctgtgcgtcc aagtggagtc 2580 

aggttttcta ttcctttctg tgtttgcaaa ttcagtgtta actaaataaa ggtattttgt 2640 

ttttcaaaaa aaaaaaaaaa aaa 2663 

<210> 14 

<211> 2993 

<212> DNA 

<213> Homo Sapiens 

<400> 14 

gggcctgcag ttggcagaag ggtcccgggc ccagagccag cggggccgtg ctgagacggc 60 

gtacgtgccc tgcgtgagtg cgtggcggcg gcgcgtgcgc taggggagtg ggcggtgagg 120 

cctggtccac gtgcgtccct tcccgggacc cccgcagctt ggcgcccagc ggctacgtga 180 

gccaaggcac ccggatgtcc gcgcccctct ccgagtgaca agtcccggcc tccggtcccg 240 

cagtgcccgc agcctcggcc ggcgtccacg cattgccatg gtgactgtgg gcaactactg 300 

cgaggccgaa gggcccgtgg gtccggcctg gatgcaggat ggcctgagtc cctgcttctt 360 

cttcacgctc gtgccctcga cgcggatggc tctagggact ctggccttgg tgctggctct 420 

tccctgcaga cgccgggagc ggcccgctgg tgctgattcg ctgtcttggg gggccggccc 480 
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tcgcatctct ccctacgtgc 


tgcagctgct 


tctggccaca 


cttcaggcgg 


cgctgcccct 


540 


ggccggcctg gctggccggg 


tgggcactgc 


ccggggggcc 


ccactgccaa 


gctatctact 


600 


tctggcctcc gtgctggaga 


gtctggccgg 


cgcctgtggc 


ctgtggctgc 


ttgtcgtgga 


660 


gcggagccag gcacggcagc 


gtctggcaat 


gggcatctgg 


atcaagttca 


ggcacagccc 


720 


tggtctcctg ctcctctgga 


ctgtggcgtt 


tgcagctgag 


aacttggccc 


tggtgtcttg 


780 


gaacagccca cagtggtggt 


gggcaagggc 


agacttgggc 


caacaggttc 


agtttagcct 


84 0 


gtgggtgctg cggtatgtgg 


tctctggagg 


gctgtttgtc 


ctgggtctct 


gggcccctgg 


900 


acttcgtccc cagtcctata 


cattgcaggt 


tcatgaagag 


gaccaagatg 


tggaaaggag 


960 


ccaggttcgg tcagcagccc 


aacagtctac 


ctggcgagat 


tttggcagga 


agctccgcct 


1020 


cctgagtggc tacctgtggc 


ctcgagggag 


tccagctctg 


cagctggtgg 


tgctcatctg 


1080 


cctggggctc atgggtttgg 


aacgggcact 


caatgtgttg 


gtgcctatat 


tctataggaa 


1140 


cattgtgaac ttgctgactg 


agaaggcacc 


ttggaactct 


ctggcctgga 


ctgttaccag 


1200 


ttacgtcttc ctcaagttcc 


tccagggggg 


tggcactggc 


agtacaggct 


tcgtgagcaa 


1260 


cctgcgcacc ttcctgtgga 


tccgggtgca 


gcagttcacg 


tctcggcggg 


tggagctgct 


1320 


catcttctcc cacctgcacg 


agctctcact 


gcgctggcac 


ctggggcgcc 


gcacagggga 


1380 


ggtgctgcgg atcgcggatc 


ggggcacatc 


cagtgtcaca 


gggctgctca 


gctacctggt 


1440 


gttcaatgtc atccccacgc 


tggccgacat 


catcattggc 


atcatctact 


tcagcatgtt 


1500 


cttcaacgcc tggtttggcc 


tcattgtgtt 


cctgtgcatg 


agtctttacc 


tcaccctgac 


1560 


cattgtggtc actgagtgga 


gaaccaagtt 


tcgtcgtgct 


atgaacacac 


aggagaacgc 


1620 


tacccgggca cgagcagtgg 


actctctgct 


aaacttcgag 


acggtgaagt 


attacaacgc 


1680 


cgagagttac gaagtggaac 


gctatcgaga 


ggccatcatc 


aaatatcagg 


gtttggagtg 


1740 


gaagtcgagc gcttcactgg 


ttttactaaa 


tcagacccag 


aacctggtga 


ttgggctcgg 


1800 


gctcctcgcc ggctccctgc 


tttgcgcata 


ctttgtcact 


gagcagaagc 


tacaggttgg 


I860 


ggactatgtg ctctttggca 


cctacattat 


ccagctgtac 


atgcccctca 


attggtttgg 


1920 


cacctactac aggatgatcc 


agaccaactt 


cattgacatg 


gagaacatgt 


ttgacttgct 


1980 


gaaagaggag acagaagtga 


aggaccttcc 


tggagcaggg 


ccccttcgct 


ttcagaaggg 


2040 


ccgtattgag tttgagaacg 


tgcacttcag 


ctatgccgat 


gggcgggaga 


ctctgcagga 


2100 


cgtgtctttc actgtgatgc 


ctggacagac 


acttgccctg 


gtgggcccat 


ctggggcagg 


2160 


gaagagcaca attttgcgcc 


tgctgtttcg 


cttctacgac 


atcagctctg 


gctgcatccg 


2220 


aatagatggg caggacattt 


cacaggtgac 


ccaggcctct 


ctccggtctc 


acattggagt 


2280 


tgtgccccaa gacactgtcc 


tctttaatga 


caccatcgcc 


gacaatatcc 


gttacggccg 


2340 
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tgtcacagct gggaatgatg aggtggaggc tgctgctcag gctgcaggca tccatgatgc 2400 

cattatggct ttccctgaag ggtacaggac acaggtgggc gagcggggac tgaagctgag 2460 

cggcggggag aagcagcgcg tcgccattgc ccgcaccatc ctcaaggctc cgggcatcat 252 0 

tctgctggat gaggcaacgt cagcgctgga tacatctaat gagagggcca tccaggcttc 2580 

tctggccaaa gtctgtgcca accgcaccac catcgtagtg gcacacaggc tctcaactgt 2 640 

ggtcaatgct gaccagatcc tcgtcatcaa ggatggctgc atcgtggaga ggggacgaca 2700 

cgaggctctg ttgtcccgag gtggggtgta tgctgacatg tggcagctgc agcagggaca 2760 

ggaagaaacc tctgaagaca ctaagcctca gaccatggaa cggtgacaaa agtttggcca 2820 

cttccctctc aaagactaac ccagaaggga ataagatgtg tctcctttcc ctggcttatt 2880 

tcatcctggt cttggggtat ggtgctagct atggtaaggg aaagggacct ttccgaaaaa 2940 

catcttttgg ggaaataaaa atgtggactg tgaaaaaaaa aaaaaaaaaa aaa 2993 



<210> 15 

<211> 1733 

<212> DNA 

<213> Homo Sapiens 




<400> 15 
gggcaatttg 


ttagttatcc gccgccacca agacgcggca cggcgcctgg accggagggg 


60 


ccccgcgcgg 


gcgcgaactt tgggctcggg cgagtgggtg gtgctccgcc cagcccgaga 


120 


cgggcgggcg 


cgcgggccaa tgggtgccgc ctcttggccg cggggggccc cgacccgtgg 


180 


gtcccggcca 


ccagcgcccc agccccgagg ctcagaagcg gcaggcggag gcgcggtccg 


240 


ggcgctatgg 


ccatgcccgg cgggtctcac gcggctgccc ctcgcccggc gcgccttcgg 


300 


tagggggcgc 


ccggggccca gctggcccgg ccatgctgct ggagacacag gacgcgctgt 


360 


acgtggcgct 


ggagctggtc atcgccgcgc tttcggtggc gggcaacgtg ctggtgtgcg 


420 


ccgcggtggg 


cacggcgaac actctgcaga cgcccaccaa ctacttcctg gtgtccctgg 


480 


ctgcggccga 


cgtggccgtg gggctcttcg ccatcccctt tgccatcacc atcagcctgg 


540 


gcttctgcac 


tgacttctac ggctgcctct tcctcgcctg cttcgtgctg gtgctcacgc 


600 


agagctccat 


cttcagcctt ctggccgtgg cagtcgacag atacctggcc atctgtgtcc 


660 


cgctcaggta 


taaaagtttg gtcacgggga cccgagcaag aggggtcatt gctgtccfcct 


720 


gggtccttgc 


ctttggcatc ggattgactc cattcctggg gtggaacagt aaagacagtg 


780 


ccaccaacaa 


ctgcacagaa ccctgggatg gaaccacgaa tgaaagctgc tgccttgtga 


840 


agtgtctctt 


tgagaatgtg gtccccatga gctacatggt atatttcaat ttctttgggt 


900 


gtgttctgcc 


cccactgctt ataatgotgg tgatctacat taagatcttc ctggtggcct 


960 
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gcaggcagct 


tcagcgcact 


gagctgatgg 


accactcgag 


gaccaccctc 


cagcgggaga 


1020 


tccatgcagc 


caagtcactg 


gccatgattg 


tggggatttt 


tgccctgtgc 


tggttacctg 


1080 


tgcatgctgt 


taactgtgtc 


actcttttcc 


agccagctca 


gggtaaaaat 


aagcccaagt 


1140 


gggcaatgaa 


tatggccatt 


cttctgtcac 


atgccaattc 


agttgtcaat 


cccattgtct 


1200 


atgcttaccg 


gaaccgagac 


ttccgctaca 


cttttcacaa 


aattatctcc 


aggtatcttc 


1260 


tctgccaagc 


agatgtcaag 


agtgggaatg 


gtcaggctgg 


ggtacagcct 


gctctcggtg 


1320 


tgggcctatg 


atctaggctc 


tcgcctcttc 


caggagaaga 


tacaaatcca 


caagaaacaa 


1380 


agaggacacg 


gctggttttc 


attgtgaaag 


atagctacac 


ctcacaagga 


aatggactgc 


1440 


ctctcttgag 


cacttccctg 


gagctaccac 


gtatctagct 


aatatgtatg 


tgtcagtagt 


1500 


aggctccaag 


gattgacaaa 


tatatttatg 


atctattcag 


ctgcttttac 


tgtgtggatt 


1560 


atgccaacag 


cttgaatgga 


ttctaacaga 


ctcttttgtt 


tttaaaagtc 


tgccttgttt 


1620 


atggtggaaa 


attactgaaa 


ctattttact 


gtgaaacagt 


gtgaactatt 


ataatgcaaa 


1680 


tactttttaa 


cttagaggca 


atggaaaaat 


aaaagttgac 


tgtactaaaa 


atg 


1733 


<210> 16 

<211> 3338 

<212> DNA 

<213> Homo Sapiens 












<400> 16 
cccagcccgg 


ccccgccgcc 


ccggctgcgc 


acgcgacgcc 


ccctccaggc 


cccgctcctg 


60 


cgccctattt 


ggtcattcgg 


ggggcaagcg 


gcgggagggg 


aaacgtgcgc 


ggccgaaggg 


120 


gaagcggagc 


cggcgccggc 


tgcgcagagg 


agccgctctc 


gccgccgcca 


cctcggctgg 


180 


gagcccacga 


ggctgccgca 


tcctgccctc 


ggaacaatgg 


gactcggcgc 


gcgaggtgct 


240 


tgggccgcgc 


tgctcctggg 


gacgctgcag 


gtgctagcgc 


tgctgggggc 


cgcccatgaa 


300 


agcgcagcca 


tggcggagac 


tctccaacat 


gtgccttctg 


accatacaaa 


tgaaacttcc 


360 


aacagtactg 


tgaaaccacc 


aacttcagtt 


gcctcagact 


ccagtaatac 


aacggtcacc 


420 


accatgaaac 


ctacagcggc 


atctaataca 


acaacaccag 


ggatggtctc 


aacaaatatg 


480 


acttctacca 


ccttaaagtc 


tacacccaaa 


acaacaagtg 


tttcacagaa 


cacatctcag 


540 


atatcaacat 


ccacaatgac 


cgtaacccac 


aatagttcag 


tgacatctgc 


tgcttcatca 


600 


gtaacaatca 


caacaactat 


gcattctgaa 


gcaaagaaag 


gatcaaaatt 


tgatactggg 


660 


agctttgttg 


gtggtattgt 


attaacgctg 


ggagttttat 


ctattcttta 


cattggatgc 


720 


aaaatgtatt 


actcaagaag 


aggcattcgg 


tatcgaacca 


tagatgaaca 


tgatgccatc 


780 


atttaaggaa 


atccatggac 


caaggatgga 


atacagattg 


atgctgccct 


atcaattaat 


840 
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tttggtttat 


taatagttta 


aaacaatatt ctctttttga aaatagtata aacaggccat 


900 


gcatataatg 


tacagtgtat 


tacgtaaata tgtaaagatt cttcaaggta acaagggttt 


960 


gggttttgaa 


ataaacatct 


ggatcttata gaccgttcat acaatggttt tagcaagttc 


1020 


atagtaagac 


aaacaagtcc 


tatctttttt tttttggctg gggtgggggc attggtcaca 


1080 


tatgaccagt 


aattgaaaga 


cgtcatcact gaaagacaga atgccatctg ggcatacaaa 


1140 


taagaagttt 


gtcacagcac 


tcaggatttt gggtatcttt tgtagctcac ataaagaact 


1200 


tcagtgcttt 


tcagagctgg 


atatatctta attactaatg ccacacagaa attatacaat 


1260 


caaactagat 


ctgaagcata 


atttaagaaa aacatcaaca ttttttgtgc tttaaactgt 


1320 


agtagttggt 


ctagaaacaa 


aatactccaa gaaaaagaaa attttcaaat aaaacccaaa 


1380 


ataatagctt 


tgcttagccc 


tgttagggat ccattggagc attaaggagc acatattttt 


1440 


attaacttct 


tttgagcttt 


caatgttgat gtaatttttg ttctctgtgt aatttaggta 


1500 


aactgcagtg 


tttaacataa 


taatgtttta aagacttagt tgtcagtatt aaataatcct 


1560 


ggcattatag 


ggaaaaaacc 


tcctagaagt tagattattt gctactgtga gaatattgtc 


1620 


accactggaa 


gttactttag 


ttcatttaat tttaatttta tattttgtga atattttaag 


1680 


aactgtagag 


ctgctttcaa 


tatctagaaa tttttaattg agtgtaaaca cacctaactt 


1740 


taagaaaaag 


aaccgcttgt 


atgattttca aaagaacatt tagaattcta tagagtcaaa 


1800 


actatagcgt 


aatgctgtgt 


ttattaagcc agggattgtg ggacttcccc caggcaacta 


1860 


aacctgcagg 


atgaaaatgc 


tatattttct ttcatgcact gtcgatatta ctcagatttg 


1920 


gggaaatgac 


atttttatac 


taaaacaaac accaaaatat tttagaataa attcttagaa 


1980 


agttttgaga 


ggaattttta 


gagaggacat ttcctccttc ctgatttgga tattccctca 


2040 


aatccctcct 


cttactccat 


gctgaaggag aagtactctc agatgcatta tgttaatgga 


2100 


gagaaaaagc 


acagtattgt 


agagacacca atattagcta atgtattttg gagtgttttc 


2160 


cattttacag 


tttatattcc 


agcactcaaa actcagggtc aagttttaac aaaagaggta 


2220 


tgtagtcaca 


gtaaatacta 


agatggcatt tctatctcag agggccaaag tgaatcacac 


2280 


cagtttctga 


aggtcctaaa 


aatagctcag atgtcctaat gaacatgcac ctacatttaa 


2340 


taqqaqtaca 


ataaaactat 


L-yuLdytLLi LyLCLtdcdy agaacgccag acancaagaa 


2400 


ttttgaaatg 


gatcatttct 


acttgctgtg cattttaacc aataatctga tgaatataga 


2460 


aaaaaatgat 


ccaaaatatg 


gatatgattg gatgtatgta acacatacat ggagtatgga 


2520 


ggaaattfctc 


tgaaaaatac 


atttagatta gtttagtttg aaggagaggt gggctgatgg 


2580 


ctgagttgta 


tgttactaac 


ttggccctga ctggttgtgc aaccattgct tcatttcttt 


2640 
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gcaaaatgta 


gttaagatat actttattct aatgaaggcc ttttaaattt gtccactgca 


2700 


ttcttggtat 


ttcactactt caagtcagtc agaacttcgt agaccgacct gaagtttctt 


2760 


tttgaatact 


tgtttcttta gcactttgaa gatagaaaaa ccacttttta agtactaagt 


2820 


catcatttgc 


cttgaaagtt tcctctgcat tgggtttgaa gtagtttagt tatgtctttt 


2880 


tctctgtatg 


taagtagtat aatttgttac tttcaaatac ccgtactttg aatgtaggtt 


2940 


tttttgttgt 


tgttatctat aaaaattgag ggaaatggtt atgcaaaaaa atattttgct 


3000 


ttggaccata 


tttcttaagc ataaaaaaat gctcagtttt gcttgcattc cttgagaatg 


3060 


tatttatctg 


aagatcaaaa caaacaatcc agatgtataa gtactaggca gaagccaatt 


3120 


ttaaaatttc 


cttgaataat ccatgaaagg aataattcaa atacagataa acagagttgg 


3180 


cagtatatta 


tagtgataat tttgtatttt caamaaaaaa aaagttaaac tcttcttttc 


3240 


tttttattat 


aatgaccagc ttttggtatt tcattgttac caagttctat ttttagataa 


3300 


aattgttctc 


cttctaaaaa aaaaaaaaaa aaaaaaaa 


3338 


<210> 17 

<211> 1214 

<212> DNA 

<213> Homo Sapiens 




<400> 17 
gtctacaccc 


cctcctcaca cgcacttcac ctgggtcggg attctcaggt catgaacggt 


60 


cccagccacc 


tccgggcagg gcgggtgagg acggggacgg ggcgtgtcca actggctgtg 


.120 


ggctcttgaa 


acccgagcat ggcacagcac ggggcgatgg gcgcgtttcg ggccctgtgc 


180 


ggcctggcgc 


tgctgtgcgc gctcagcctg ggtcagcgcc ccaccggggg tcccgggtgc 


240 


ggccctgggc 


gcctcctgct tgggacggga acggacgcgc gctgctgccg ggttcacacg 


300 


acgcgctgct 


gccgcgatta cccgggcgag gagtgctgtt ccgagtggga ctgcatgtgt 


360 


gtccagcctg 


aattccactg cggagaccct tgctgcacga cctgccggca ccacccttgt 


420 


cccccaggcc 


agggggtaca gtcccagggg aaattcagtt ttggcttcca gtgtatcgac 


480 


tgtgcctcgg 


ggaccttctc cgggggccac gaaggccact gcaaaccttg gacagactgc 


540 


acccagttcg 


ggtttctcac tgtgttccct gggaacaaga cccacaacgc tgtgtgcgtc 


600 


ccagggtccc 


cgccggcaga gccgcttggg tggctgaccg tcgtcctcct ggccgtggcc 


660 


gcctgcgtcc 


tcctcctgac ctcggcccag cttggactgc acatctggca gctgaggagt 


720 


cagtgcatgt 


ggccccgaga gacccagctg ctgctggagg tgccgccgtc gaccgaagac 


780 


gccagaagct 


gccagttccc cgaggaagag cggggcgagc gatcggcaga ggagaagggg 


840 


cggctgggag 


acctgtgggt gtgagcctgg ccgtcctccg gggccaccga ccgcagccag 


900 
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cccctcccca ggagctcccc 


aggccgcagg 


ggctctgcgt 


tctgctctgg 


gccgggccct 


960 


gctcccctgg 


cagcagaagt 


gggtgcagga 


aggtggcagt 


gaccagcgcc 


ctggaccatg 


1020 


cagttcggcg gccgcggctg 


ggccctgcag 


gagggagaga 


gagacacagt 


catggccccc 


1080 


ttcctccctt 


gctggccctg 


atggggtggg 


gtcttaggac 


gggaggctgt 


gtccgtgggt 


1140 


gtgcagtgcc 


cagcacggga 


cccggctgca 


ggggaccttc 


aataaacact 


tgtccagtga 


1200 


aaaaaaaaaa 


aaaa 










1214 


<210> 18 

<211> 2322 

<212> DNA 

<213> Homo Sapiens 












<400> 18 
agatccgcga 


gcccgtcagc 


ctgcgccatg 


ggctgcgacg 


gccgcgtgtc 


ggggctgctc 


60 


cgccgcaacc 


tgcagcccac 


gctcacctac 


tggagcgtct 


tcttcagctt 


cggcctgtgc 


120 


atcgccttcc 


tggggcccac 


gctgctggac 


ctgcgctgtc 


agacgcacag 


ctcgctgccc 


180 


cagatctcct 


gggtcttctt 


ctcgcagcag 


ctctgcctcc 


tgctgggcag 


cgccctcggg 


240 


ggcgtcttca 


aaaggaccct 


ggcccagtca 


ctatgggccc 


tgttcacctc 


ctctctggcc 


300 


atctccctgg 


tgtttgccgt 


catccccttc 


tgccgcgacg 


tgaaggtgct 


ggcctcagtc 


360 


atggcgctgg 


cgggcttggc 


catgggctgc 


atcgacaccg 


tggccaacat 


gcagctggta 


420 


aggatgtacc 


agaaggactc 


ggccgtcttc 


ctccaggtgc 


tccatttctt 


cgtgggcttt 


480 


ggtgctctgc 


tgagccccct 


tattgctgac 


cctttcctgt 


ctgaggccaa 


ctgcttgcct 


540 


gccaatagca 


cggccaacac 


cacctcccga 


ggccacctgt 


tccatgtctc 


cagggtgctg 


600 


ggccagcacc 


acgtagatgc 


caagccttgg 


tccaaccaga 


cgttcccagg 


gctgactcca 


660 


aaggacgggg 


cagggacccg 


agtgtcctat 


gccttctgga 


tcatggccct 


catcgatctt 


720 


ccagtgccca 


tggctgtgct 


gatgctgctg 


tccaaggagc 


ggctgctgac 


ctgctgtccc 


780 


cagaggaggc 


ccctgcttct 


gtctgctgat 


gagcttgcct 


tggagacaca 


gcctcctgag 


840 


aaggaagatg 


cctcctcact 


gcccccaaag 


tttcagtcac 


acctagggca 


tgaggacctg 


900 


ttcagctgct 


gccaaaggaa 


gaacctcaga 


ggagcccctt 


attccttctt 


tgccatccac 


960 


atcacgggcg 


ccctggtact 


gttcatgacg 


gatgggttga 


cgggtgccta 


ttccgccttc 


1020 


gtgtacagct 


atgctgtgga 


gaagcccctg 


tctgtgggac 


acaaggtggc 


tggctacctc 


1080 


cccagcctct 


tctggggctt 


catcacactg 


ggccggctcc 


tctccattcc 


catatcctca 


1140 


agaatgaagc 


cggccaccat 


ggttttcatc 


aacgtggttg 


gcgtggtggt 


gacgttcctg 


1200 


gtgctgctta 


ttttctccta 


caacgtcgtc 


ttcctgttcg 


tggggacggc 


aagcctgggc 


1260 
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ctgtttctca 


gcagcacctt 


ccccagcatg ctggcctaca cggaggactc gctgcagtac 


1320 


aaaggctgtg 


caaccacagt 


gctggtgaca ggggcaggag ttggcgagat ggtgctgcag 


1380 


atgctggttg 


gttcgatatt 


ccaggctcag ggcagctata gtttcctggt ctgtggcgtg 


1440 


atctttggtt 


gtctggcttt 


taccttctat atcttgctcc tgtttttcca caggatgcac 


1500 


cctggactcc 


catcagttcc 


tacccaagac agatcaattg gaatggaaaa ctctgagtgc 


1560 


taccagaggt 


aaaactgggt 


gaagaaggca agagaagact ttcagcctct tgatcaccag 


1620 


cacgaccata 


ctgtttcaga 


aagctgggtg gtggtggagg cgctctctca atggctattc 


1680 


aagtcttctc 


cactaaaact 


tggttgggta gaggaaatta aattgagtcc tggtacctgg 


1740 


tcaaaatcat 


tagaagttta 


cctggcttct caagttatct tcttccctgg ttcagactgt 


1800 


tggtaagagc 


tgtccagata 


cccagatggg aaggaaggag acagccgcgc gcttcactcc 


1860 


atttgtcacc 


tcatgcatgg 


accatactct gggtttgaga tcattcttca ttgaagtttg 


1920 


taaaaatagg 


ttgaaattgt 


aaagctccat gatcactgct atatgtagat atatttcaat 


1980 


ttaagcaaaa 


caagctgcaa 


gttattccct ggcatgctca aaggattttc gtgcttttca 


2040 


cttaatagtc 


caaagtctct 


taaattcctg ctgcagatat caatagctta tctatattct 


2100 


UddaCdCCdd 


aaggaaaagt 


tgaatcttgc tctctttggt atactaatgt agtggtatgc 


2160 


taagctggct 


cataccaact 


tagaaaagct gattgtaaaa ttttcatttt gacagctggt 


2220 


tattaaatgc 


agccattatt 


aaaaatcaaa tcatacaaac ttataattaa atcaattaca 


2280 


tttaaaacaa 


aggtaataaa 


tattcaaagc atatcacttc ct 


2322 


<210> 19 

<211> 5361 

<212> DNA 

<213> Homo Sapiens 






<400> 19 
ctgcaaaccc 


agcgcaacta 


cggtcccccg gtcagaccca ggatggggcc agaacggaca 


60 


ggggccgcgc 


cgctgccgct 


gctgctggtg ttagcgctca gtcaaggcat tttaaattgt 


120 


tgtttggcct 


acaatgttgg 


tctcccagaa gcaaaaatat tttccggtcc ttcaagtgaa 


180 


cagtttgggt 


atgcagtgca 


gcagtttata aatccaaaag gcaactggtt actggttggt 


240 


tcaccctgga gtggctttcc 


tgagaaccga atgggagatg tgtataaatg tcctgttgac 


300 


ctatccactg ccacatgtga 


aaaactaaat ttgcaaactt caacaagcat tccaaatgtt 


360 


actgagatga aaaccaacat 


gagcctcggc ttgatcctca ccaggaacat gggaactgga 


420 


ggttttctca 


catgtggtcc 


tctgtgggca cagcaatgtg ggaatcagta ttacacaacg 


480 


ggtgtgtgtt 


ctgacatcag 


tcctgatttt cagctctcag ccagcttctc acctgcaact 


540 
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cagccctgcc cttccctcat agatgttgtg gttgtgtgtg atgaatcaaa tagtatttat 600 

ccttgggatg cagtaaagaa ttttttggaa aaatttgtac aaggccttga tataggcccc 660 

acaaagacac aggtggggtt aattcagtat gccaataatc caagagttgt gtttaacttg 720 

aacacatata aaaccaaaga agaaatgatt gtagcaacat cccagacatc ccaatatggt 780 

ggggacctca caaacacatt cggagcaatt caatatgcaa gaaaatatgc ctattcagca 840 

gcttctggtg ggcgacgaag tgctacgaaa gtaatggtag ttgtaactga cggtgaatca 900 

catgatggtt caatgttgaa agctgtgatt gatcaatgca accatgacaa tatactgagg 960 

tttggcatag cagttcttgg gtacttaaac agaaacgccc ttgatactaa aaatttaata 1020 

aaagaaataa aagcgatcgc tagtattcca acagaaagat actttttcaa tgtgtctgat 1080 

gaagcagctc tactagaaaa ggctgggaca ttaggagaac aaattttcag cattgaaggt 1140 

actgttcaag gaggagacaa ctttcagatg gaaatgtcac aagtgggatt cagtgcagat 1200 

tactcttctc aaaatgatat tctgatgctg ggtgcagtgg gagcttttgg ctggagtggg 1260 

accattgtcc agaagacatc tcatggccat ttgatctttc ctaaacaagc ctttgaccaa 1320 

attctgcagg acagaaatca cagttcatat ttaggttact ctgtggctgc aatttctact 1380 

ggagaaagca ctcactttgt tgctggtgct cctcgggcaa attataccgg ccagatagtg 1440 

ctatatagtg tgaatgagaa tggcaatatc acggttattc aggctcaccg aggtgaccag 1500 

attggctcct attttggtag tgtgctgtgt tcagttgatg tggataaaga caccattaca 1560 

gacgtgctct tggtaggtgc accaatgtac atgagtgacc taaagaaaga ggaaggaaga 1620 

gtctacctgt ttactatcaa aaagggcatt ttgggtcagc accaatttct tgaaggcccc 1680 

gagggcattg aaaacactcg atttggttca gcaattgcag ctctttcaga catcaacatg 1740 

gatggcttta atgatgtgat tgttggttca ccactagaaa atcagaattc tggagctgta 1800 

tacatttaca atggtcatca gggcactatc cgcacaaagt attcccagaa aatcttggga 1860 

tccgatggag cctttaggag ccatctccag tactttggga ggtccttgga tggctatgga 1920 

gatttaaatg gggattccat caccgatgtg tctattggtg cctttggaca agtggttcaa 1980 

ctctggtcac aaagtattgc tgatgtagct atagaagctt cattcacacc agaaaaaatc 2040 

actttggtca acaagaatgc tcagataatt ctcaaactct gcttcagtgc aaagtfccaga 2100 

cctactaagc aaaacaatca agtggccatt gtatataaca tcacacttga tgcagatgga 2160 

ttttcatcca gagtaacctc cagggggtta tttaaagaaa acaatgaaag gtgcctgcag 2220 

aagaatatgg tagtaaatca agcacagagt tgccccgagc acatcatfcta tatacaggag 2280 

ccctctgatg ttgtcaactc tttggatttg cgtgtggaca tcagtctgga aaaccctggc 2340 

actagccctg cccttgaagc ctattctgag actgccaagg tcttcagtat tcctttccac 2400 
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aaagactgtg gtgaggatgg actttgcatt tctgatctag tcctagatgt ccgacaaata 2460 

ccagctgctc aagaacaacc ctttattgtc agcaaccaaa acaaaaggtt aacattttca 2520 

gtaacactga aaaataaaag ggaaagtgca tacaacactg gaattgttgt tgatttttca 2580 

gaaaacttgt tttttgcatc attctcccta ccggttgatg ggacagaagt aacatgccag 2640 

gtggctgcat ctcagaagtc tgttgcctgc gatgtaggct accctgcttt aaagagagaa 2700 

caacaggtga cttttactat taactttgac ttcaatcttc aaaaccttca gaatcaggcg 2760 

tctctcagtt tccaagcctt aagtgaaagc caagaagaaa acaaggctga taatttggtc 2 820 

aacctcaaaa ttcctctcct gtatgatgct gaaattcact taacaagatc taccaacata 2880 

aatttttatg aaatctcttc ggatgggaat gttccttcaa tcgtgcacag ttttgaagat 2 940 

gttggtccaa aattcatctt ctccctgaag gtaacaacag gaagtgttcc agtaagcatg 3000 

gcaactgtaa tcatccacat ccctcagtat accaaagaaa agaacccact gatgtaccta 3060 

actggggtgc aaacagacaa ggctggtgac atcagttgta atgcagatat caatccactg 3120 

aaaataggac aaacatcttc ttctgtatct ttcaaaagtg aaaatttcag gcacaccaaa 3180 

gaattgaact gcagaactgc ttcctgtagt aatgttacct gctggttgaa agacgttcac 3240 

atgaaaggag aatactttgt taatgtgact accagaattt ggaacgggac tttcgcatca 330 0 

tcaacgttcc agacagtaca gctaacggca gctgcagaaa tcaacaccta taaccctgag 3360 

atatatgtga ttgaagataa cactgttacg attcccctga tgataatgaa acctgatgag 3420 

aaagccgaag taccaacagg agttataata ggaagtataa ttgctggaat ccttttgctg 3480 

ttagctctgg ttgcaatttt atggaag.ctc ggcttcttca aaagaaaata tgaaaagatg 3540 

accaaaaatc cagatgagat tgatgagacc acagagctca gtagctgaac cagcagacct 3600 

acctgcagtg ggaaccggca gcatcccagc cagggtttgc tgtttgcgtg catggatttc 3660 

tttttaaatc ccatattttt tttatcatgt cgtaggtaaa ctaacctggt attttaagag 3720 

aaaactgcag gtcagtttgg atgaagaaat tgtggggggt gggggaggtg cggggggcag 3780 

gtagggaaat aatagggaaa atacctattt tatatgatgg gggaaaaaaa gtaatcttta 3840 

aactggctgg cccagagttt acattctaat ttgcattgtg tcagaaacat gaaatgcttc 3900 

caagcatgac aacttttaaa gaaaaatatg atactctcag attttaaggg ggaaaactgt 3960 

tctctttaaa atatttgtct ttaaacagca actacagaag tggaagtgct tgatatgtaa 4020 

gtacttccac ttgtgtatat tttaatgaat attgatgtta acaagagggg aaaacaaaac 4080 

acaggttttt tcaatttatg ctgctcatcc aaagttgcca cagatgatac ttccaagtga 4140 

taattttatt tataaactag gtaaaatttg ttgttggttc cttttatacc acggctgccc 42 00 
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cttccacacc 


ccatcttgct 


ctaatgatca 


aaacatgctt 


gaataactga 


gcttagagta 


4260 


tacctcctat 


atgtccattt 


aagttaggag 


agggggcgat 


atagagacta 


aggcacaaaa 


4320 


ttttgtttaa 


aactcagaat 


ataacattta 


tgtaaaatcc 


catctgctag 


aagcccatcc 


4380 


tgtgccagag 


gaaggaaaag 


gaggaaattt 


cctttctctt 


ttaggaggca 


caacagttct 


4440 


cttctaggat 


ttgtttggct 


gactggcagt 


aacctagtga 


atttttgaaa 


gatgagtaat 


4500 


ttctttggca 


accttcctcc 


tcccttactg 


aaccactctc 


ccacctcctg 


gtggtaccat 


4560 


tattatagaa 


gccctctaca 


gcctgacttt 


ctctccagcg 


gtccaaagtt 


atcccctcct 


4620 


ttacccctca 


tccaaagttc 


ccactccttc 


aggacagctg 


ctgtgcatta 


gatattaggg 


4680 


gggaaagtca 


tctgtttaat 


ttacacactt 


gcatgaatta 


ctgtatataa 


actccttaac 


4740 


ttcagggagc 


tattttcatt 


tagtgctaaa 


caagtaagaa 


aaataagcta 


gagtgaattt 


4800 


ctaaatgttg 


gaatgttatg 


ggatgtaaac 


aatgtaaagt 


aaaacactct 


caggatttca 


4860 


ccagaagtta 


cagatgaggc 


actggaaacc 


accaccaaat 


tagcaggtgc 


accttctgtg 


4920 


gctgtcttgt 


ttctgaagta 


ctttttcttc 


cacaagagtg 


aatttgacct 


aggcaagttt 


4980 


gttcaaaagg 


tagatcctga 


gatgatttgg 


tcagattggg 


ataaggccca 


gcaatctgca 


5040 


ttttaacaag 


caccccagtc 


actaggatgc 


agatggacca 


cactttgaga 


aacaccaccc 


5100 


atttctactt 


tttgcacctt 


attttctctg 


ttcctgagcc 


cccacattct 


ctaggagaaa 


5160 


cttagattaa 


aattcacaga 


cactacatat 


ctaaagcttt 


gacaagtcct 


tgacctctat 


5220 


aaacttcaga 


gtcctcatta 


taaaatggga 


agactgagct 


ggagttcagc 


agtgatgctt 


5280 


tttagtttta 


aaagtctatg 


atctgatctg 


gacttcctat 


aatacaaata 


cacaatcctc 


5340 


caagaatttg 


acttggaaaa 


g 








5361 


<210> 20 

<211> 1519 

<212> DNA 

<213> Homo Sapiens 












<400> 20 

agcaggcgtt tgcgagagga 


gatacgagct 


ggacgcctgg 


cccttccctc 


ccaccgggtc 


60 


ctagtccacc gctcccggcg 


ccggctcccc 


gcctctcccg 


ctatgtaccg 


accgcgagcc 


120 


cgggcggctc ccgagggcag 


ggtccggggc 


tgcgcggtgc 


ccagcaccgt 


gctcctgctg 


180 


ctcgcctacc tggcttacct 


ggcgctgggc 


accggcgtgt 


tctggacgct 


ggagggccgc 


240 


gcggcgcagg actccagccg 


cagcttccag 


cgcgacaagt 


gggagctgtt 


gcagaacttc 


300 


acgtgtctgg accgcccggc 


gctggactcg 


ctgatccggg 


atgtcgtcca 


agcatacaaa 


360 


aacggagcca gcctcctcag 


caacaccacc 


agcatggggc 


gctgggagct 


cgtgggctcc 


420 
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ttcttctttt 


ctgtgtccac 


catcaccacc 


attggctatg 


gcaacctgag ccccaacacg 


480 


atggctgccc 


gcctcttctg 


catcttcttt 


gcccttgtgg 


ggatcccact caacctcgtg 


540 


gtgctcaacc 


gactggggca 


tctcatgcag 


cagggagtaa 


accactgggc cagcaggctg 


600 


gggggcacct 


ggcaggatcc 


tgacaaggcg 


cggtggctgg 


cgggctctgg cgccctcctc 


660 


tcgggcctcc 


tgctcttcct 


gctgctgcca 


ccgctgctct 


tctcccacat ggagggctgg 


720 


agctacacag 


agggcttcta 


cttcgccttc 


atcaccctca 


gcaccgtggg cttcggcgac 


780 


tacgtgattg 


gaatgaaccc 


ctcccagagg 


tacccactgt 


ggtacaagaa catggtgtcc 


840 


ctgtggatcc 


tctttgggat 


ggcatggctg 


gccttgatca 


tcaaactcat cctctcccag 


900 


ctggagacgc 


cagggagggt 


atgttcctgc 


tgccaccaca 


gctctaagga agacttcaag 


960 


tcccaaagct 


ggagacaggg 


acctgaccgg 


gagccagagt 


cccactcccc acagcaagga 


1020 


tgctatccag 


agggacccat 


gggaatcata 


cagcatctgg 


aaccttctgc tcacgctgca 


1080 


ggctgtggca 


aggacagcta 


gttatactcc 


attctttggt 


cgtcgtcctc ggtagcaaga 


1140 


cccctgattt 


taagctttgc 


acatgtccac 


ccaaactaaa 


gactacattt tccatccacc 


1200 


ctagaggctg 


ggtgcagcta 


tatgattaat 


tctgcccaat 


agggtataca gagacatgtc 


1260 


ctgggtgaca 


tgggatgtga 


ctttcgggtg 


tc 999gcagc 


atgcccttct cccccacttc 


1320 


cttactttag 


cgggctgcaa 


tgccgccgat 


atgatggctg 


ggagctctgg cagccatacg 


1380 


gcaccatgaa 


gtagcggcaa 


tgtttgagcg 


gcacaattag 


ataggaagag tctggatctc 


1440 


tgatgatcac 


agagccatcc 


taacaaacgg 


aatatcaccg 


accctccttt atgtgagaga 


1500 


gaaataaaca 


tctatgaaa 








1519 


<210> 21 

<211> 1832 

<212> DNA 

<213> Homo Sapiens 










<400> 21 

aaggacagag gaggggccct 


tcctgtcagc 


tggctgggag 


cagaggtggc tfctgtctttt 


60 


cggaagaact ggttctgtgg 


aatttgtgct 


tatttcccat 


caaggatcaa ggacctgctc 


120 


tggggctacc tcagggcccc 


acaggatgag 


gggctggttt 


tcagatgagt tttctgcttg 


180 


cctgtcatct ggatagtgtc 


taaaaatttg 


caaactgcct 


tcttgtcagt gtcttgctca 


240 


ttcttcatga cactcctgat 


atgtctctca 


gtttcctcat 


ctgctgcctc tccagacttc 


300 


tgccagaaca ttgcacgcga 


cagtttcagg 


cacagaactg 


actggcagca ggggctgctc 


360 


cacgagtggg aatttgctcc 


agcacttcac 


ggactgcaag 


cgaggcactt gctaactctt 


420 


ggataacaag acctctgcca 


gaagaaccat 


ggctttggaa 


ggcggagttc aggctgagga 


480 
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gatgggtgcg gtcctcagtg agcccctgcc tccctgaaca taggaaaccc acctgggcag 540 

ccatggaatg ggacaatggc acaggccagg ctctgggctt gccacccacc acctgtgtct 600 

accgcgagaa cttcaagcaa ctgctgctgc cacctgtgta ttcggcggtg ctggcggctg 660 

gcctgccgct gaacatctgt gtcattaccc agatctgcac gtcccgccgg gccctgaccc 720 

gcacggccgt gtacacccta aaccttgctc tggctgacct gctatatgcc tgctccctgc 780 

ccctgctcat ctacaactat gcccaaggtg atcactggcc ctttggcgac ttcgcctgcc 840 

gcctggtccg cttcctcttc tatgccaacc tgcacggcag catcctcttc ctcacctgca 900 

tcagcttcca gcgctacctg ggcatctgcc acccgctggc cccctggcac aaacgtgggg 960 

gccgccgggc tgcctggcta gtgtgtgtag ccgtgtggct ggccgtgaca acccagtgcc 1020 

tgcccacagc catcttcgct gccacaggca tccagcgtaa ccgcactgtc tgctatgacc 1080 

tcagcccgcc tgccctggcc acccactata tgccctatgg catggctctc actgtcatcg 1140 

gcttcctgct gccctttgct gccctgctgg cctgctactg tctcctggcc tgccgcctgt 1200 

gccgccagga tggcccggca gagcctgtgg cccaggagcg gcgtggcaag gcggcccgca 1260 

tggccgtggt ggtggctgct gcctttgcca tcagcttcct gccttttcac atcaccaaga 132 0 

cagcctacct ggcagtgcgc tcgacgccgg gcgtcccctg cactgtattg gaggcctttg 1380 

cagcggccta caaaggcacg cggccgtttg ccagtgccaa cagcgtgctg gaccccatcc 1440 

tcttctactt cacccagaag aagttccgcc ggcgaccaca tgagctccta cagaaactca 1500 

cagccaaatg gcagaggcag ggtcgctgag tcctccaggt cctgggcagc cttcatattt 1560 

gccattgtgt ccggggcacc aggagcccca ccaaccccaa accatgcgga gaattagagt 1620 

tcagctcagc tgggcatgga gttaagatcc ctcacaggac ccagaagctc accaaaaact 1680 

atttcttcag ccccttctct ggcccagacc cfcgtgggcat ggagatggac agacctgggc 1740 

ctggctcttg agaggtccca gtcagccatg gagagctggg gaaaccacat taaggtgctc 1800 

acaaaaatac agtgtgacgt gtactgtcaa aa 1832 . 

<210> 22 

<211> 2811 

<212> DNA 

<213> Homo Sapiens 

<400> 22 

acacgtccaa cgccagcatg cagcgcccgg gcccccgcct gtggctggtc ctgcaggtga 60 

tgggctcgtg cgccgccatc agctccatgg acatggagcg cccgggcgac ggcaaatgcc 120 

agcccatcga gatcccgatg tgcaaggaca tcggctacaa catgactcgt atgcccaacc 180 

tgatgggcca cgagaaccag cgcgaggcag ccatccagtt gcacgagttc gcgccgctgg 240 
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tggagtacgg 


ctgccacggc 


cacctccgct 


tcttcctgtg 


ctcgctgtac 


gcgccgatgt 


300 


gcaccgagca 


ggtctctacc 


cccatccccg 


cctgccgggt 


catgtgcgag 


caggcccggc 


360 


tcaagtgctc 


cccgattatg 


gagcagttca 


acttcaagtg 


gcccgactcc 


ctggactgcc 


420 


ggaaactccc 


caacaagaac 


gaccccaact 


acctgtgcat 


ggaggcgccc 


aacaacggct 


480 


cggacgagcc 


cacccggggc 


tcgggcctgt 


tcccgccgct 


gttccggccg 


cagcggcccc 


540 


acagcgcgca 


ggagcacccg 


ctgaaggacg 


ggggccccgg 


gcgcggcggc 


tgcgacaacc 


600 


cgggcaagtt 


ccaccacgtg 


gagaagagcg 


cgtcgtgcgc 


gccgctctgc 


acgcccggcg 


660 


tggacgtgta 


ctggagccgc 


gaggacaagc 


gcttcgcagt 


ggtctggctg 


gccatctggg 


720 


cggtgctgtg 


cttcttctcc 


agcgccttca 


ccgtgctcac 


cttcctcatc 


gacccggccc 


780 


gcttccgcta 


ccccgagcgc 


cccatcatct 


tcctctccat 


gtgctactgc 


gtctactccg 


840 


tgggctacct 


catccgcctc 


ttcgccggcg 


ccgagagcat 


cgcctgcgac 


cgggacagcg 


900 


gccagctcta 


tgtcatccag 


gagggactgg 


agagcaccgg 


ctgcacgctg 


gtcttcctgg 


960 


tcctctacta 


cttcggcatg 


gccagctcgc 


tgtggtgggt 


ggtcctcacg 


ctcacctggt 


1020 


tcctggccgc 


cggcaagaag 


tggggccacg 


aggccatcga 


agccaacagc 


agctacttcc 


1080 


acctggcagc 


ctgggccatc 


ccggcggtga 


agaccatcct 


gatccfcggtc 


atgcgcaggg 


1140 


tggcggggga 


cgagctcacc 


ggggtctgct 


acgtgggcag 


catggacgtc 


aacgcgctca 


1200 


ccggcttcgt 


gctcattccc 


ctggcctgct 


acctggtcat 


cggcacgtcc 


ttcatcctct 


1260 


cgggcttcgt 


ggccctgttc 


cacatccgga 


gggtgatgaa 


gacgggcggc 


gagaacacgg 


1320 


acaagctgga 


gaagctcatg 


gtgcgtatcg 


ggctcttctc 


tgtgctgtac 


accgtgccgg 


1380 


ccacctgtgt 


gatcgcctgc 


tacttttacg 


aacgcctcaa 


catggattac 


tggaagatcc 


1440 


tggcggcgca 


gcacaagtgc 


aaaatgaaca 


accagactaa 


aacgctggac 


tgcctgatgg 


1500 


ccgcctccat 


ccccgccgtg 


gagatcttca 


tggtgaagat 


ctttatgctg 


ctggtggtgg 


1560 


ggatcaccag 


cgggatgtgg 


atttggacct 


ccaagactct 


gcagtcctgg 


cagcaggtgt 


1620 


gcagccgtag 


gttaaagaag 


aagagccgga 


gaaaaccggc 


cagcgtgatc 


accagcggtg 


1680 


ggatttacaa 


aaaagcccag 


catccccaga 


aaactcacca 


cgggaaatat 


gagatccctg 


1740 


cccagtcgcc 


cacctgcgtg 


tgaacagggc 


tggagggaag 


ggcacagggg 


cgcccggagc 


1800 


taagatgtgg 


tgcttttctt 


ggttgtgttt 


ttctttcttc 


ttcttctttt 


tttttttttt 


1860 


ataaaagcaa 


aagagaaata 


cataaaaaag 


tgtttaccct 


gaaattcagg 


atgctgtgat 


1920 


acactgaaag 


gaaaaatgta 


cttaaagggt 


tttgttttgt 


tttggttttc 


cagcgaaggg 


1980 


aagctcctcc 


agtgaagtag 


cctcttgtgt 


aactaatttg 


tggtaaagta 


gttgattcag 


2040 


ccctcagaag 


aaaacttttg 


tttagagccc 


tccgtaaata 


tacatctgtg 


tatttgagtt 


2100 
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ggctttgcta 


cccatttaca 


aataagagga 


cccctgggtt 


aacaaatgag 


ccatccccag 


gcggcatccc 


tgcagaggaa 


agacaggacc 


gagttgctta 


aaatagactc 


tggccttcac 


atcaaacctc 


acatttgtga 


actcaaacga 


aaataaaaag 


agaaacaagt 


attttgctat 


aaaagaacta 


agaggcccag 


ccctcagaaa 


atggaaacca 


agccaatgtt 


atagacgttt 


gggagaagga 


tcattcaaaa 


gttacccaaa 


aatgatttcc 


acaaacagat 


caggaagcac 


ttctcttcac 


agtgccagga 


aagagtggtt 


tttttcatgc 


tccactattt 


tattaaaaat 


<210> 23 

<211> 2010 

<212> PNA 

<213> Homo sapiens 




<400> 23 
ggcagcgact 


gcgccccgtc 


ccgqcgccqc 


ggcagctgcg 


gctcgggatc 


cqtcqaqqqq 


gggtcatggt 


gcccggcgcc 


cqcqqcqqcq 


tcctggcttt 


gctgctcgcg 


gtctccgccc 


atggctgtgg 


acacctagtg 


acttatcagg 


ccgggaccta 


ccccaatcac 


actgtttgcg 


gactgattct 


gaggttggga 


gatttggata 


ttctcttcac 


cagctcttca 


gatcaatatg 


aagaactctt 


gttgaacaca 


agtgaagtaa 


ctggccgggg 


ttttttgctg 


acctatgcga 


tggaacgagc 


tagccattat 


ttgaagacag 


gagacgtagc 


aggagacatt 


tctgggaata 


tgtgcaaagc 


tgccatccat 


gcaggaataa 


tgcttcagcg 


caaagggatc 


agtcgatatg 


gggatggttc 


cctgtcagac 


aagcgatttc 



cagataactg 


ctttgcaaat 


tcaagagcct 


2160 


ggcccacccc 


caggaaggcc 


acaqtqctgg 


2220 


c 9999cccgc 


ctcacacccc 


agtggatttg 


2280 


caatagtctc 


tctgcaagac 


agaaacctcc 


2340 


tgtgcaatac 


atttttttct 


ct ttccttga 


2400 


atataaagac 


aacaaaagaa 


atctcr^aac 

\^ \* V* \^ %^ V-o 


2460 


cccttcagtg 


ctacattttg 


taactt t*ttra 

^-j V** t^. \* I— C-* 


2520 


ggactgattt 


gtggaaagga 


ggggggaaga 


2580 


gggcttattg 


actctttcta 


ttgttaaaca 


2640 


taggttggca 


gagacacttt 


gtctagtgta 


2700 


tctgcgtgtg 


tatatttgta 


atatatgata 


2760 


aaaatatgtt 


ctttaaaaaa 


a 


2811 



gctcgtccgc 


agaggaggcg 


gcccggcccg 


60 


aggccgagct 


tgccaagctg 


gcgcccagcg 


120 


gcgcactggc 


gcgggctgcc 


gggcggggcc 


180 


cgctccggct 


gcaggcggag 


gagctgggtg 


240 


atagtggcac 


aatgacatct 


aagaattatc 


300 


aaaagacaat 


tacagtacca 


aaggggaaaa 


360 


tcgaatccca 


gacctgtgct 


tctgactatc 


420 


gtccatactg 


tggaagtatg 


actgttccca 


480 


ccgtccgctt 


tgagagtgga 


tcccacattt 


540 


gcagcgacca 


tccagattta 


ataacatgtt 


600 


aatacagcaa 


attctgccca 


gctggttgta 


660 


tggtagatgg 


atatagagat 


acctctttat 


720 


ttgctgatga 


actaggtggc 


cagatcagtg 


780 


aagggattct 


ggccaatggt 


gttctttcga 


840 


tgtttacctc 


caatggttgc 


agcagatcct 


900 
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tgagttttga 


acctgacggg 


caaatcagag 


cttcttcctc 


atggcagtcg 


gtcaatgaga 


960 


gtggagacca 


agttcactgg 


tctcctggcc 


aagcccgact 


tcaggaccaa 


ggcccatcat 


1020 


gggcttcggg 


cgacagtagc 


aacaaccaca 


aaccacgaga 


gtggctggag 


atcgatttgg 


1080 


gggagaaaaa 


gaaaataaca 


ggaattagga 


ccacaggatc 


tacacagtcg 


aacttcaact 


1140 


tttatgttaa 


gagttttgtg 


atgaacttca 


aaaacaataa 


ttctaagtgg 


aagacctata 


1200 


aaggaattgt 


gaataatgaa 


gaaaaggtgt 


ttcagggtaa 


ctctaacttt 


cgggacccag 


1260 


tgcaaaacaa 


tttcatccct 


cccatcgtgg 


ccagatatgt 


gcgggttgtc 


ccccagacat 


1320 


ggcaccagag 


gatagccttg 


aaggtggagc 


tcattggttg 


ccagattaca 


caaggtaatg 


1380 


attcattggt 


gtggcgcaag 


acaagtcaaa 


gcaccagtgt 


ttcaactaag 


aaagaagatg 


1440 


agacaatcac 


aaggcccatc 


ccctcggaag 


aaacatccac 


aggaataaac 


attacaaegg 


1500 


tggctattcc 


attggtgctc 


cttgttgtcc 


tggtgtttgc 


tggaatgggg 


atetttgeag 


1560 


cctttagaaa 


gaagaagaag 


aaaggaagtc 


cgtatggatc 


agcagaggct 


cagaaaacag 


1620 


actgttggaa 


gcagattaaa 


tatccctttg 


ccagacatca 


gtcagctgag 


tttaccatca 


1680 


gctatgataa 


tgagaaggag 


atgacacaaa 


agttagatct 


catcacaagt 


gatatggcag 


1740 


gttaactccg 


ttgactgcca 


aaatagcatc 


cccaacgtgc 


agccctccgc 


atctatcagc 


1800 


aggttgcccc 


ggatggatct 


cagagatgag 


gatcggaaca 


ccatgttctt 


tcccacccta 


1860 


acaacaacaa 


agggcagtaa 


attaaagtac 


tctttgtaag 


gtacagttac 


cgattaatct 


1920 


agagataaaa 


tattttctta 


aaaatatatt 


tcattaaaca 


c c t at gc tg t 


ctctataaaa 


1980 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 








2010 


<210> 24 

<211> 2010 

<212> DNA 

<213> Homo sapiens 












<400> 24 

ggcagcgact gcgccccgtc 


ccggcgccgc 


gctcgtccgc 


agaggaggcg 


gcccggcccg 


60 


ggcagctgcg gctcgggatc 


cgtcgagggg 


aggccgagct 


tgccaagctg 


gcgcccagcg 


120 


gggtcatggt gcccggcgcc 


cgcggcggcg 


gcgcactggc 


gcgggct qcc 


qqqcqqqqcc 


180 


tcctggcttt gctgctcgcg 


gtctccgccc 


cgctccggct 


geaggeggag 


gagcfcgggtg 


240 


atggctgtgg acacctagtg 


acttatcagg 


atagtggcac 


aatgacatct 


aagaafcfcatc 


300 


ccgggaccta ccccaatcac 


actgtttgcg 


aaaagacaat 


tacagtacca 


aaggggaaaa 


360 


gactgattct gaggttggga 


gatttggata 


tcgaatccca 


gacctgtgct 


tctgactatc 


420 


ttctcttcac 


cagctcttca 


gatcaatatg 


gtccatactg 


tggaagtatg 


actgttccca 


480 
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aagaactctt gttgaacaca 


agtgaagtaa 


ccgtccgctt 


tgagagtgga 


tcccacattt 


540 


ctggccgggg ttttttgctg 


acctatgcga 


gcagcgacca 


tccagattta 


ataacatgtt 


600 


tggaacgagc 


tagccattat 


ttgaagacag 


aatacagcaa 


attctgccca 


gctggttgta 


660 


gagacgtagc 


aggagacatt 


tctgggaata 


tqqtaqatqq 


atatagagat 


acctctttat 


720 


tgtgcaaagc 


tgccatccat 


gcaggaataa 


ttgctgatga 


actaqqtqqc 


caqatcaqta 


780 


tgcttcagcg caaagggatc 


agtcgatatg 


aagggattct 


qqccaatqqt 


gttctttcga 


840 


gggatggttc 


cctgtcagac 


aagcgatfcfcc 


fcafchtaccfcc 

w \-4 L_ I* W w 


raataattac 






tgagttttga 


acctgacggg 


caaatcagag 


ctfccttccfcc 


ataacaatcQ 


otcaataaoa 




gtggagacca 


agttcactgg 


tctcctggcc 


aagcccgact 


tcaggaccaa 


aocccatcal" 

^ ^— w Li. v^- el i— 


1 09 Ci 


gggcttcggg 


cgacagtagc 


aacaaccaca 


aaccacgaga 


qtqqctqqaq 


atcqatttaa 


1080 


gggagaaaaa 


gaaaataaca 


ggaattagga 


ccacaggatc 


tacacagtcg 


aacttcaact 


1140 


tttatgttaa 


gagttttgtg 


atgaacttca 


aaaacaataa 


ttctaagtgg 


aagacctata 


1200 


aaggaattgt 


gaataatgaa 


gaaaaggtgt 


ttcagggtaa 


ctctaacttt 


CQqqacccacr 


1260 


tgcaaaacaa 


tttcatccct 


cccatcgtgg 


ccagatatgt 


qcqqqttqtc 


ccccagacat 


1320 


ggcaccagag 


gatagccttg 


aaqqtqqaqc 

3-3 w 3 3 3 


tcattqqttq 


ccagattaca 


caaaataata 


1380 




y tgy cy c aag 


acaagtcaaa 


gcaccagtgt 


ttcaactaao 


aaaaaacfat"cr 
cic*c*y rci^u ^*y 


1440 


agacaatcac 


aaggcccatc 


ccctcggaag 


aaacafc ccac 




,3 f- = pa o CCtn 


xouv 


tggctattcc 


attggtgctc 


cfctgttgtcc 


tqatatthac 

^" 3 3 wyuwu-yu 


t* era a a t cracicr 


a t~ r -1 f~ +~ t~ ryr* a rr 
a *w <»- c» iw- y w cx y 


i Rfin 


cctttagaaa 


gaagaagaag 


aaaggaagtc 


catataaatc 




v«y clci densely 


1 

J-D^i u 


actgttggaa 


gcagattaaa 


tatccctttg 


ccaaacatca 

w» t* fc* W w CI 


atcaactaaa 




lOOU 


gctatgataa 


tgagaaggag 


atgacacaaa 


agttagatct 


catcacaagt 


aatataacaa 


1740 


gttaactccg 


ttgactgcca 


aaatagcatc 


cccaacgtgc 


agccctccgc 


atctatcagc 


1800 


aggttgcccc 


ggatggatct 


cagagatgag 


gatcggaaca 


ccatgttctt 


tcccacccta 


1860 


acaacaacaa 


agggcagtaa 


attaaagtac 


tctttgtaag 


gtacagttac 


cgattaatct 


1920 


agagataaaa 


tattttctta 


aaaatatatt 


tcattaaaca 


ectatgetgt 


ctctataaaa 


1980 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 








2310 



<210> 25 

<211> 1159 

<212> DNA 

<213> Homo Sapiens 



<400> 25 

agtgccccag gagctatgac aagcaaagga acatacttgc ctggagatag ectttgegat 



60 
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atttaaatgt 


ccgtggatac 


agaaatctct 


gcaggcaagt 


tgctccagag 


catattgcag 


120 


gacaagcctg 


taacgaatag 


ttaaattcac 


ggcatctgga 


ttcctaatcc 


ttttccgaaa 


180 


tggcaggtgt 


gagtgcctgt 


ataaaatatt 


ctatgtttac 


cttcaacttc 


ttqttctqqc 


240 


tatgtggtat 


cttgatccta 


gcattagcaa 


tatgggtacg 


agtaagcaat 


gactctcaag 


300 


caatttttgg 


ttctgaagat 


gtaggctcta 


gctcctacgt tgctgtggac 


a tat taafcf-a 




ctqtaqqtqc 


catcatcatg 


attctcrcicrct 


tcctgggatg 


ctgcggtgct 




*i ^ U 


gtcgctgcat 


gcttctgttg 


tttttcatag 


gcttgcttct gatcctgctc 




/I fi n 


cgacaggtat 


cctaggagct 


gttttcaaat 


ctaagtctga 


tcgcattgtg 




c a n 


tctatgaaaa 


cacaaagctt 


ttgagcgcca 


caggggaaag 


tgaaaaacaa 


ttccaaoaaa 

w wl y Q, ciy 




ccataattgt 


gtttcaagaa 


gagtttaaat 


gctgcggttt 


ggtcaatgga 




u o v 


ggggaaataa 


ttttcaacac 


tatcctgaat 


tatgtgcctg 


tctagataag 


cagagaccat 


720 


gccaaagcta 


taatggaaaa 


caagtttaca 


aagagacctg 


tatttctttc 


ataaaagact 


780 


tcttggcaaa 


aaatttgatt 


atagttattg 


gaatatcatt 


tggactggca 


attattaaaa 


840 


tactgggttt 


ggtgttttct 


atggtcctgt 


attgccagat 


cgggaacaaa 


taaafcrtatcr 


900 


gatgcatcaa 


cctatcgtca 


gtcaaacccc 


tttaaaatgt 


tgctttggct 


ttgtaaattt 


960 


aaatatgtaa 


gtgctatata 


agtcaggagc 


agctgtcttt ttaaaatgtc 


tcggctagct 


1020 


agaccacaga 


tatcttctag 


acatattgaa 


cacatttaag 


atttgaggga 


tataagggaa 


1080 


aatgatatga 


atgtgtattt 


ttactcaaaa 


taaaagtaac 


tgtttacgtt 


aaaaaaaaaa 


1140 


aaaaaaaaaa 


aaaaaaaaa 










1159 



<210> 26 

<211> 1428 

<212> DNA 

<213> Homo Sapiens 

<400> 26 



cttcaggtca gggagaatgt 


ataaatgtcc 


attgccatcg aggttctgct 


atttttgaga 


60 


agctgaagca actccaagga 


cacagttcac 


agaaatttgg ttctcagccc 


caaaatactg 


120 


attgaattgg agacaattac 


aaggactctc 


tggccaaaaa cccttgaaga 


ggccccgtga 


180 


aggaggcagt gaggagcttt 


tgattgctga 


cctgtgtcgt accaccccag 


aatgtgcact 


240 


gggggctgtg ccagatgcct 


gggggggacc 


ctcattcccc ttgctttttt 


tggcttcctg 


300 


gctaacatcc tgttattttt 


tcctggagga 


aaagtgatag atgacaacga 


ccacctttcc 


360 


caagagatct ggtttttcgg 


aggaatatta 


ggaagcggtg tcttgatgat 


cttccctgcg 


420 


ctggtgttct tgggcctgaa 


gaacaatgac 


tgctgtgggt gctgcggcaa 


cgagggctgt 


480 
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gggaagcgat 


ttgcgatgtt 


cacctccacg 


atatttgctg 


tggttggatt cttgggagct 


540 


ggatactcgt 


ttatcatctc 


agccatttca 


atcaacaagg 


gtcctaaatg cctcatggcc 


600 


aatagtacat 


ggggctaccc 


cttccacgac 


Qqqqattatc 


tcaatgatga ggccttatgg 


660 


aacaagtgcc 


gagagcctct 


caatgtggtt 


ccctggaatc 


tgaccctctt ctccatcctg 


720 


ctggtcgtag 


gaggaatcca 


gatggttctc 


tgcgccatcc 


aggtggtcaa tggcctcctg 


780 


gggaccctct 


qtaaqqactQ 


ccagtgttgt 




crcrcrcracrataof accccrtttaa 


840 


acctccgaga 


tgagctgctc 


agactctaca 


acatciacqac 


tacaatttct tttcataaaa 


900 


cttcttctct 


tcttggaatt 


attaattcct 


atctgcttcc 


tagctgataa agcttagaaa 


960 


aggcagttat 


tccttctttc 


caaccagctt 


tgctcgagtt 


agaattttgt tattttcaaa 


1020 


taaaaaatag 


tttggccact 


taacaaattt 


gatttataaa 


tctttcaaat tagttccttt 


1080 


ttagaattta 


ccaacaggtt 


caaagcatac 


ttttcatgat 


ttttttatta caaatgtaaa 


1140 


atgtataaag 


tcacatgtac 


tgccatacta 


cttctttgta 


tataaagatg tttatatctt 


1200 


tggaagtttt 


acataaatca 


aaggaagaaa 


gcacatttaa 


aatgagaaac taagaccaat 


1260 


ttctgttttt- 


aagaggaaaa 


agaatgattg 


atgtatccta 


agtattgtta tttgttgtct 


1320 


ttttttgctg 


ccttgcttga 


gttgcttgtg 


actgatcttt 


tgaggctgtc atcatggcta 


1380 


gggttctttt 


atgtatgtta 


aattaaaacc 


tgaattcaga 


ggtaacgt 


1428 



<210> 27 

<211> 2454 

<212> DNA 

<213> Homo Sapiens 

<400> 27 



ccaggaggca 


cgctggtttt 


ccggggccgc 


tccatcgcgc 


cttcctcctg cgcctcgctt 


60 


ctccggtcca 


gccgccatct 


tcctttccgc 


acaggggccg 


ccgagcgggg ccatgcagcc 


120 


aacgctgctt 


ctcagcctcc 


tgggagccgt 


ggggctggcg 


gctgtcaatt ccatgccagt 


180 


ggataacagg 


aaccacaatg 


aaggaatggt 


gactcgctgc 


atcattgagg tcctctcaaa 


240 


tgccttgtcg 


aagtccagcg 


ctccacccat 


cacccctgag 


tgccgccaag tcctgaagac 


300 


gagtagaaaa 


gacgtcaaag 


acaaagagac 


aactgaaaat 


gaaaacacaa agtttgaagt 


360 


aagattgtta 


agagacccag 


ctgatgcctc 


ggaagcccac 


gagtcctcca gcaggggaga 


420 


ggcaggagcc 


ccaggggagg 


aggacatcca 


aggcccaaca 


aaggcagaca cagagaaatg 


480 


ggcagaggga 


ggcgggcaca 


gccgagagcg 


agcggatgag 


ccccagtgga gcctctatcc 


540 


ctccgacagc 


caagtctctg 


aagaagtgaa 


gacacgccat 


tctgagaaga gccagagaga 


600 


ggatgaggag 


gaggaggagg 


gagagaacta 


tcaaaaaggg 


gagcgagggg aagatagcag 


660 
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tgaagagaaa 


caccttgaag 


agccaggaga 


gacacaaaac 


gcttttctca 


atgaaagaaa 


720 


gcaggcttca 


gctataaaaa 


aagaggagtt 


agtggccaga 


tcggaaacac 


atgctgccgg 


780 


gcattctcag 


gagaagacac 


atagccgaga 


gaagagtagc 


caggagagtg 


gagaggaggc 


840 


agggagccag 


gagaatcacc 


cccaggagtc 


taaaggccaa 


ccccgaagcc 


aggaagaatc 


900 


tgaggaaggt 


gaggaagatg 


ccacctctga 


ggtggacaaa 


cgacgcacga 


ggcccagaca 


960 


ccaccacggg 


aggagcaggc 


ccgacaggtc 


ctctcaagga 


gggagtcttc 


cctctgagga 


1020 


aaagggacac 


ccccaggagg 


aatctgagga 


gtcaaacgtc 


agcatggcca 


gtttagggga 


1080 


aaagagggac 


caccattcaa 


cccactacag 


ggcttcagag 


gaagaacctg 


aatatggaga 


1140 


agaaataaag 


ggttatccag 


gcgtccaggc 


ccctgaggac 


ctggagtggg 


agcgctatag 


1200 


gggcagagga 


agtgaagaat 


acagggctcc 


aagacctcag 


agtgaggaga 


Qttgggatga 


1260 


ggaggacaag 


agaaactacc 


ccagcttaga 


gcttgataag 


atggcacatg 


gatatggtga 


1320 


agaaagtgag 


gaagagaggg 


gccttgagcc 


gggaaaggga 


cgccatcaca 


gaggcagggg 


1380 


aggggagcca 


cgtgcctatt 


tcatgtctga 


caccagagaa 


gagaaaaggt 


tcttgggtga 


1440 


aggacaccac 


cgtgtccaag 


aaaaccagat 


ggacaaggca 


aggaggcatc 


cacaaggtgc 


1500 


gtggaaagag 


ctggacagaa 


attatctcaa 


ctacggtgag 


gaaggagccc 


cagggaagtg 


1560 


gcagcagcag 


ggagacctgc 


aggacactaa 


agaaaacagg 


gaggaagcta 


ggtttcaaga 


1620 


taaacaatat 


agctcccatc 


acacagctga 


aaagaggaag 


agattagggg 


aactgttcaa 


1680 


cccatactac 


gaccctctcc 


agtggaagag 


cagccatttt 


gaaagaagag 


acaacatgaa 


1740 


tgacaatttt 


ctcgagggtg 


aggaggaaaa 


tgagctgacc 


ttgaacgaga 


agaatttctt 


1800 


cccagaatac 


aactatgact 


ggtgggagaa 


aaagcccttc 


tctqaggatg 


tgaactqqqg 


1860 


gtatgagaag 


agaaacctcg 


ccagggtccc 


caagctggac 


ctgaaaaggc 


aatatgacag 


1920 


ggtggcccaa 


ctggaccagc 


tccttcacta 


caggaagaag 


tcagctgagt 


ttccagactt 


1980 


ctatgattct 


gaggagccgg 


tgagcaccca 


ccaggaggca 


gaaaatgaaa 


aggacagggc 


2040 


tgaccagaca 


gtcctgacag 


aggacgagaa 


aaaagaactc 


gaaaacttgg 


ctgcaatgga 


2100 


tttggaacta 


cagaagatag 


ctgagaaatt 


cagccaaagg 


ggctgactgt 


cattggagcg 


2160 


gtgggcactg 


ttaagaagca 


gccatcacat 


gatctgtttt 


tcaccacttc 


actgaaagac 


2220 


accatttata 


tacccaaggg 


cagaaagtag 


aacttactat 


tcattaaatg 


tttgacacaa 


2280 


ttggaattgt 


ctttaatttc 


tgtcagaatg 


ctattgaaaa 


tgtgaattgc 


atgacttgta 


2340 


gcatattctt 


ttctgcaaaa 


tagacatatt 


aacatgctta 


tgacaatgac 


tgtgctactg 


2400 


tctttggaaa 


aatgtttgtc 


tcagttggaa 


ataataaaag 


attcacctga 


gacc 


2454 
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<210> 28 

<211> 1980 

<212> DNA 

<213> Homo Sapiens 

<400> 28 



cttcttgtgg 


tagggacctc 


tcctcagtat ttgaaactaa 


ccagcatctg acagatttcg 


60 


aatttgtaaa 


aaataccctc 


gaagattcag gaatgaagct 


tctgtgtgaa ggattaaaac 


120 


agcccaactg 


tgtattacag 


acattgaggt ggtaccggtg 


ccttatctct tctgcttctt 


180 


gtggggctct 


agcagctgtt 


cttagcacca gtcagtggct 


cactgaactg gaatttagtg 


240 


agacaaaact 


ggaagcttca 


gctttgaaat tgctctatgg 


aggcttaaaa gatccaaatt 


300 


gcaaattaca 


gaagctcaac 


ttgcagtttt ctttatctgt 


aaccgctgca aaacttccag 


360 


ttggaatggt 


tggaaattgt 


tctggtttct cgggatcatt 


ggtgcaatct cattttggct 


420 


actgtcagga 


cagttctttc 


aaatgtgatc tttgtaagct 


gctctggcct tccaccagag 


480 


ttgctgctgc 


aaaggattgt 


gggagtccta agtccttcct 


atcagaaggg ctgaactggg 


540 


caggaagact 


tgaggcagtg 


gaggaggttt tggggttggg 


ggtgcttgta cagcccqqtq 


600 


acccagcatc 


tcagggtggg 


gggcattgtg aaaactatgg 


gtcttttaga gacttggtgg 


660 


acttagaagt 


caaggcagaa 


ccaagcctga gaaaaggtgg 


tatQoatctc caaaaaccca 


720 


ccctacaagt 


tgtcctcctt 


tgcaaaatct tctccctcaa 


actatttctc tttattgcat 


780 


tgcctaattc 


tcctggtcag 


gttagtgtgg tgcaagtgac 


catcccagac ggtttcgtga 


840 


acgtgactgt 


tggatctaat 


gtcactctca tctgcatcta 


caccaccact gtggcctccc 


900 


gagaacagct 


ttccatccag 


tggtctttct tccataagaa 


ggagatggag ccaatttctt 


960 


ctccttggga 


ggaggggaag 


tggccagatg ttgaggctgt 


gaagggcact cttgatggac 


1020 


agcaggctga 


actccagatt 


tacttttctc aaggtggaca 


agctgtagcc atcgggcaat 


1080 


ttaaagatcg 


aattacaggg 


tccaacgatc caggtaatgc 


atctatcact atctcgcata 


1140 


t.ycciycca.yc 


ayacaytyyo 


attcacaccn gcgacgccaa 


caacccccca gactttctcg 


1200 


gccaaaacca 


aggcatcctc 


aacgtcagtg tgttagtgaa 


accttctaag cccctttgta 


1260 


gcgttcaagg 


aagaccagaa 


actggccaca ctatttccct 


ttcctgtctc tctgcgcttg 


1320 


gaacaccttc 


ccctgtgtac 


tactggcata aacttgaggg 


aagagacatc gtgccagtga 


1380 


aagaaaactt 


caacccaacc 


accgggattt tggtcattgg 


aaatctgaca aattttgaac 


1440 


aaggttatta 


ccagtgtact 


gccatcaaca gacttggcaa 


tagttcctgc gaaatcgatc 


1500 


tcacttcttc 


acatccagaa 


gttggaatca ttgttggggc 


cttgattggt agcctggtag 


1560 


gtgccgccat 


catcatctct 


gttgtgtgct tcgcaaggaa 


taaggcaaaa gcaaaggcaa 


1620 
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aagaaagaaa 


ttctaagacc 


atcgcggaac ttgagccaat gacaaagata aacccaaggg 


1680 


gagaaagcga 


agcaatgcca 


agagaagacg ctacccaact agaagtaact ctaccatctt 


1740 


ccattcatga gactggccct 


CJrit" 3CP31" ff A^OflSfTflfia ri"af*niinppa aan^nharii-^ 


1800 


aggagcctgc 


cccagagcct 


gccccaggat cagagcctat ggcagtgcct gaccttgaca 


I860 


tcgagctgga gctggagcca 


gaaacgcagt cggaattgga gccagagcca gagccagagc 


1920 


cagagtcaga gcctggggtt 


gtagttgagc ccttaagtga agatgaaaag ggagtggtta 


1980 


<210> 29 

<2ll> 1242 

<212> DNA 

<213> Homo Sapiens 






<400> 29 
atggtgttcg 


cattttggaa 


ggtctttctg atcctaagct gccttgcagg tcaggttagt 


60 


gtggcgcaag 


tgaccatccc 


agacggtttc gtgaacgtga ctgttggatc taatgtcact 


120 


c tcaucxzyca 


fcctacaccac 


cactgtggcc tcccgagaac agctttccat ccagtggtct 


180 


fctcfcfcccata 


agaaggagafc 


ggagccaatt tcttctcctt gggaggaggg gaagtggcca 


240 


gacgctgagg 


ctgtgaaggg 


cactcttgat ggacagcagg ctgaactcca gatttacttt 


300 


i-cccaaggcg 


gacaagctgt 


agccatcggg caatttaaag atcgaattac agggtccaac 


360 


gatccagg ta 


acgcacccac 


cactatctcg catatgcagc cagcagacag tggaatttac 


420 


auctgcgac-g 


ttaacaaccc 


cccagacttt ctcggccaaa accaaggcat cctcaacgtc 


480 


agtgtgttag 


tgaaaccttc 


taagcccctt tgtagcgttc aaggaagacc agaaactggc 


540 


cacactattt 


ccctttcctg 


tctctctgcg cttggaacac cttcccctgt gtactactgg 


600 


cataaacttg 


agggaagaga 


catcgtgcca gtgaaagaaa acttcaaccc aaccaccggg 


660 


attttggtca 


ttggaaatct 


gacaaatttt gaacaaggtt attaccagtg tactgccatc 


720 


aacagacttg 


gcaatagttc 


ctgcgaaatc gatctcactt cttcacatcc agaagttgga 


780 


atcattgttg 


gggccttgat 


tggtagcctg gtaggtgccg ccatcatcat ctctgttgtg 


840 


tgcttcgcaa 


ggaataaggc 


aaaagcaaag gcaaaagaaa gaaattctaa gaccatcgcg 


900 


gaacttgagc 


caatgacaaa 


gataaaccca aggggagaaa gcgaagcaat gccaagagaa 


960 


gacgctaccc 


aactagaagt 


aactctacca tcttccattc atgagactgg ccctgatacc 


1020 


atccaagaac 


cagactatga 


gccaaagcct actcaggagc ctgccccaga gcctgcccca 


1080 


ggatcagagc 


ctatggcagt 


gcctgacctt gacatcgagc tggagctgga gccagaaacg 


1140 


cagtcggaat 


tggagccaga 


gccagagcca gagccagagt cagagcctgg ggttgtagfct 


1200 


gagcccttaa 


gtgaagatga 


aaagggagtg gttaaggcat ag 


1242 
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<210> 30 

<211> 1451 

<212> DNA 

<213> Homo Sapiens 

<400> 30 



cggcccgccc 


tggggaggcg 


cgcagcagag gctccgattc 


ggggcaggtg 


agaggctgac 


60 


tttctctcgg 


tgcgtccagt 


ggagctctga gtttcgaatc 


ggtggcggcg gattccccgc 


120 


gcgcccggcg 


tcggggcttc 


caggaggatg cggagcccca 


gcgcggcgtg gctgctgggg 


180 


gccgccatcc 


tgctagcagc 


ctctctctcc tgcagtggca 


ccatccaagg 


aaccaataga 


240 


tcctctaaag 


gaagaagcct 


tattggtaag gttgatggca 


catcccacgt 


cactggaaaa 


300 


ggagttacag 


ttgaaacagt 


cttttctgtg gatgagtttt 


ctgcatctgt 


cctcactgga 


360 


aaa'ctgacca 


cggtcttcct 


tccaattgtc tacacaattg 


tgtttgtggt 


gggtttgcca 


420 


agtaacggca 


tggccctgtg 


ggtctttctt ttccgaacta 


agaagaagca 


ccctgctgtg 


480 


atttacatgg 


ccaatctggc 


cttggctgac ctcctctctg 


tcatctggtt 


ccccttgaag 


540 


attgcctatc 


acatacatgc 


caacaactgg atttatgggg 


aagctctttg 


taatgtgctt 


600 


attggctttt 


tctatggcaa 


catgtactgt tccattctct 


tcatgacctg 


cctcagtgtg 


660 


cagaggtatt 


gggtcatcgt 


gaaccccatg gggcactcca 


ggaagaaggc 


aaacattgcc 


720 


attggcatct 


ccctggcaat 


atggctgctg attctgctgg 


tcaccatccc 


tttgtatgtc 


780 


gtgaagcaga 


ccatcttcat 


tcctgccctg aacatcacga 


cctgtcatga 


tgttttgcct 


840 


gagcagctct 


tggtgggaga 


catgttcaat tacttcctct 


ctctggccat 


tggggtcttt 


900 


ctgttcccag 


ccttcctcac 


agcctctgcc tatgtgctga 


tgatcagaat 


gctgcgatct 


960 


tctgccatgg 


atgaaaactc 


agagaagaaa aggaagaggg 


ccatcaaact 


cattgtcact 


1020 


yCCCLyyCCa 




cfcgcttcacfc cctagtaacc 


ttctgcttgt 


ggtgcattat 


1080 


tttctgatta 


agagccaggg 


ccagagccat gtctatgccc 


tgtacattgt 


agccctctgc 


1140 


ctctctaccc 


ttaacagctg 


catcgacccc tttgtctatt 


actttgtttc 


acatgatttc 


1200 


agggatcatg 


caaagaacgc 


tctcctttgc cgaagtgtcc 


gcactgtaaa gcagatgcaa 


1260 


gtatccctca 


cctcaaagaa 


acactccagg aaatccagct 


cttactcttc 


aagttcaacc 


1320 


actgttaaga 


cctcctattg 


agttttccag gtcctcagat 


gggaattgca 


cagtaggatg 


1380 


tggaacctgt 


ttaatgttat 


gaggacgtgt ctgttatttc 


ctaatcaaaa 


aggtctcacc 


1440 


acataccacc 


9 








1451 



<210> 31 
<211> 5115 
<212> DNA 
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<213> Homo Sapiens 




<400> 31 
gaattccggg 


a 9cgggcggg 


ctgcgaggcc 


ccgggtggct 


gcgcccattc 


cacacccgcc 


ccttttagga 


99 a 999 a 999 


ggaaaaggtg 


gagatcgcgg 


gtcagctttg 


cagtcgctgc 


cttcctgctg 


ggcacaggcg 


agcgctttat 


cttttcttct 


cctcaacatc 


tgaatcatgc 


tttccgtggt 


ggctcagctc 


cttaactttg 


caggcccggt 


tcgcttcccg 


gacaggaggc 


accacgtggt 


gggtccagtc 


cgagtagatg 


actatcccat 


cacgagcagc 


a 99 a 99 aa 9 a 


acfcacagaafc 


t'tctcacgag 


gagaaggacc 


ctctttccaa 


uagccacacc 


atggagaaga 


tggcttcctc 


tgcccccctc 


tgccatctca 


ttgggacggc 


agccctcagt 


gcctgccatg 


gagacttttt 


cattgaaccc 


gtgaagaagc 


acatcgttta 


caggaggcag 


aaagttccag 


acagtgttaa 


catctcccag 


aagcaagagc 


tgccaagcag 


aagcctctct 


cggcgttcca 


tggtggccga 


cacaaagatg 


attgaatacc 


tcaccatcat 


gaacatggtc 


a ctgggttgt 


acatngtcgc 


ggttcggctc 


attctactcg 


accatgcaga 


aaagacactg 


tctagcttct 


gtgacctcaa 


ucctgttcat 


cacgacgtgg 


ctggtttcaa 


tcgcccctgc 


gagaccctgg 


ctcaccgcag 


ttgtaacatc 


aatgaagatt 


atgagctagg 


acacagcttc 


ggcatccagc 


tgggcagaca 


tccgtacatc 


atgtcccgcc 


ggtccaagtg 


cagcgaggag 


tacatcaccc 


ttgatgacat 


acctaaaaag 


aaaggcttga 
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gcggggcatg 


cgggaggcgg 


a 9999tggga 


60 


gaaagcggac 


actgtcagct 


gaatcactcc 


120 


tctagctaat 


ttctgcttaa 


aaaagcacag 


180 


cttctcgcgc 


ctgaccatgc 


acccctgcat 


240 


ttctqqaqct 

3 3 3 


gagggctaaa 


acttttttca 


300 


catgtgccca 


qaqqaqctqq 


cttgcaaacc 


360 


gggcgctttg 


ctatgggaga 


cagcctcagc 


420 


aagagcattt 


tatcaagggc 


ctgccagaat 


480 


ccagtgggca 


ttttttgtca 


tatggcttgc 


540 


gagatttgga 


tggctcagag 


gactgggtgt 


600 


tgttttttaa 


cttgacggtc 


aatcaaggat 


660 


gatatgggaa 


cctctcccat 


gttaagatga 


720 


gtggcacggt 


tctacagcag 


ggcaccagag 


780 


gactgactgg 


atttttccaa 


ctaccacatg 


840 


atccactggt 


tgagggaggg 


taccacccgc 


900 


aaaccaagga 


gccaacctgt 


ggattaaagg 


960 


tatqqcqqqa 


qaaqtqqqaq 

3 3 333 3 


aggcacaact 


1020 


tcagcaagga 


qaqatqqqtq 

33333 3 


qaqacactqq 


1080 


atgggagtga 


gaatgtggag 


tcctacatcc 


1140 


tccataaccc 


aagcattggc 


aatgcaattc 


1200 


aagaagaaga 


gcaaggactg 


aaaatagttc 


1260 


gcaagtggca 


gaagagtatc 


aatcccaaga 


1320 


ctgtccttct 


caccagaaag 


gacatctgtg 


1380 


gcctgtctca 


cctttcagga 


atgtgtcagc 


1440 


cgggactccc 


tctggctttc 


acaattgccc 


1500 


atgatgggaa 


agaaaatgac 


tgtgagcctg 


1560 


agctccagta 


cgatcccact 


ccgctgacat 


1620 


gcttcttgga 


ccgaggctgg 


gggttctgtc 


1680 


agtccaaggt 


cattgccccc 


ggagtgatct 


1740 
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atgatgttca 


ccaccagtgc 


cagctacaat 


atggacccaa 


tgctaccttc 


tgccaggaag 


1800 


tagaaaacgt 


ctgccagaca 


ctgtggtgct 


ccgtgaaggg 


cttttgtcgc 


tctaagctgg 


1860 


acgctgctgc 


agatggaact 


caatgtggtg 


agaagaagtg 


gtgtatggca 


ggcaagtgca 


1920 


tcacagtggg 


gaagaaacca 


gagagcattc 


ctggaggctg 


gggccgctgg 


tcaccctggt 


1980 


cccactgttc 


caggacctgt 


ggggctggag 


tccagagcgc 


agagaggctc 


tgcaacaacc 


2040 


ccgagccaaa 


gtttggaggg 


aaatattgca 


ctggagaaag 


aaaacgctat 


cgcttgtgca 


2100 


acgtccaccc 


ctgtcgctca 


gaggcaccaa 


catttcggca 


gatgcagtgc 


agtgaatttg 


2160 


acactgttcc 


ctacaagaat 


gaactctacc 


actggtttcc 


catttttaac 


ccagcacatc 


2220 


cttgtgagct 


ctactgccga 


cccatagatg 


gccagttttc 


tgagaaaatg 


ctggatgctg 


2280 


tcattgatgg 


taccccttgc 


tttgaaggcg 


gcaacagcag 


aaatgtctgt 


attaatggca 


2340 


tatgtaagat 


ggttggctgt 


gactatgaga 


tcgattccaa 


tgccaccgag 


gatcgctgcg 


2400 


gtgtgtgcct 


gggagatggc 


tcttcctgcc 


agactgtgag 


aaagatgttt 


aagcagaagg 


2460 


aaggatctgg 


ttatgttgac 


attgggctca 


ttccaaaagg 


agcaagggac 


ataagagtga 


2520 


tggaaattga 


gggagctgga 


aacttcctgg 


ccatcaggag 


tgaagatcct 


gaaaaatatt 


2580 


acctgaatgg 


agggtttatt 


atccagtgga 


acgggaacta 


taagctggca 


gggactgtct 


2640 


ttcagtatga 


caggaaagga 


gacctggaaa 


agctgatggc 


cacaggtccc 


accaatgagt 


2700 


ctgtgtggat 


ccagcttcta 


ttccaggtga 


ctaaccctgg 


catcaagtat 


gagtacacaa 


2760 


tccagaaaga 


tggccttgac 


aatgatgttg 


agcagatgta 


cttctggcag 


tacggccact 


2820 


ggacagagtg 


cagtgtgacc 


tgcgggacag 


gtatccgccg 


ccaaactgcc 


cattgcataa 


2880 


agaagggccg 


cgggatggtg 


aaagctacat 


tctgtgaccc 


agaaacacag 


cccaatggga 


2940 


gacagaagaa 


gtgccatgaa 


aaggcttgtc 


cacccaggtg 


gtgggcaggg 


gagtgggaag 


3000 


catgctcggc 


gacatgcggg 


ccccacgggg 


agaagaagcg 


aaccgtgctg 


tgcatccaga 


3060 


ccatggtctc 


tgacgagcag 


gctctcccgc 


ccacagactg 


ccagcacctg 


ctgaagccca 


3120 


agaccctcct 


ttcctgcaac 


agagacatcc 


tgtgcccctc 


ggactggaca 


gtgggcaact 


3180 


ggagtgagtg 


ttctgtttcc 


tgtggtggtg 


gagtgcggat 


tcgcagtgtc 


acatgtgcca 


3240 


agaaccatga 


tgaaccttgc 


gatgtgacaa 


ggaaacccaa 


cagccgagct 


ctgtgtggcc 


3300 


tccagcaatg 


cccttctagc 


cggagagttc 


tgaaaccaaa 


caaaggcact 


atttccaatg 


3360 


gaaaaaaccc 


accaacacta 


aagcccgtcc 


ctccacctac 


atccaggccc 


agaatgctga 


3420 


ccacacccac 


agggcctgag 


tctatgagca 


caagcactcc 


agcaatcagc 


agccctagtc 


3480 


ctaccacagc 


ctccaaagaa 


ggagacctgg 


gtgggaaaca 


gtggcaagat 


agctcaaccc 


3540 


aacctgagct 


gagctctcgc 


tatctcattt 


ccactggaag 


cacttcccag 


cccatcctca 


3600 
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cttcccaatc 


cttgagcatt 


cagccaagtg 


aggaaaatgt 


ttccagttca 


gatactggtc 


3660 


ctacctcgga 


gggaggcctt 


gtagctacaa 


caacaagtgg 


ttctggcttg 


tcatcttccc 


3720 


gcaaccctat 


cacttggcct 


gtgactccat 


tttacaatac 


cttgaccaaa 


ggtccagaaa 


3780 


tggagattca 


cagtggctca 


ggggaagaaa 


gagaacagcc 


tgaggacaaa 


gatgaaagca 


3840 


atcctgtaat 


atggaccaag 


atcagagtac 


ctggaaatga 


cgctccagtg 


gaaagtacag 


3900 


aaatgccact 


tgcacctcca 


ctaacaccag 


atctcagcag 


ggagtcctgg 


tggccaccct 


3960 


tcagcacagt 


aatggaagga 


ctgctcccca 


gccaaaggcc 


cactacttcc 


gaaactggga 


4020 


cacccagagt 


tgaggggatg 


gttactgaaa 


agccagccaa 


cactctgctc 


cctctgggag 


4080 


gagaccacca 


gccagaaccc 


tcaggaaaga 


cggcaaaccg 


taaccacctg 


aaacttccaa 


4140 


acaacatgaa 


ccaaacaaaa 


agttctgaac 


cagtcctgac 


tgaggaggat 


gcaacaagtc 


4200 


tgattactga 


gggctttttg 


ctaaatgcct 


ccaattacaa 


gcagctcaca 


aacggccacg 


4260 


gctctgcaca 


ctggatcgtc 


ggaaactgga 


gcgagtgctc 


caccacatgt 


ggcctggggg 


4320 


cctactggaa 


aagggtggag 


tgcaccaccc 


agatggattc 


tgactgtgcg 


gccatccaga 


4380 


gacctgaccc 


tgcaaaaaga 


tgccacctcc 


gtccctgtgc 


tggctggaaa 


gtgggaaact 


4440 


ggagcaagtg 


ctccagaaac 


tgcagtgggg 


gcttcaagat 


acgcgagatt 


cagtgcgtgg 


4500 


acagccggga 


ccaccggaac 


ctgaggccat 


ttcactgcca 


gttcctggcc 


ggcattcctc 


4560 


ccccattgag 


catgagctgt 


aacccggagc 


cctgtgaggc 


gtggcaggtg 


gagccttgga 


4620 


gccagtgctc 


caggtcctgt 


ggaggtggag 


ttcaggagag 


aggagtgttc 


tgtccagqaq 


4680 


gcctctgtga 


ttggacaaaa 


agacccacat 


ccaccatgtc 


ttgcaatgag 


cacctgtgct 


4740 


gtcactgggc 


cactgggaac 


tgggacctgt 


gttccacttc 


ctgtggaggt 


ggctttcaga 


4800 


agaggattgt 


ccaatgtgtg 


ccctcagagg 


gcaataaaac 


tgaagaccaa 


gaccaatgtc 


4860 


tatgtgatca 


caaacccaga 


cctccagaat 


tcaaaaaatg 


caaccagcag 


gcctgcaaga 


4920 


aaagtgccga 


tttactttgc 


actaaggaca 


aactgtcagc 


cagtttctgc 


cagacactga 


4980 


aagccatgaa 


gaaatgttct 


gtgcccaccg 


tgagggctga 


gtgctgcttc 


tcgtgtcccc 


5040 


agacacacat 


cacacacacc 


caaaggcaaa 


gaaggcaacg 


gttgctccaa 


aagtcaaaag 


5100 


aactctaagc 


ccaaa 










5115 



<210> 32 

<211> 799 

<212> DNA 

<213> Homo Sapiens 



<400> 32 

cactcccaaa gaactgggta ctcaacactg agcagatctg ttctttgagc taaaaaccat 



60 
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gtgctgtacc 


aagagtttgc 


tcctggctgc 


tttgatgtca gtgctgctac tccacctctg 


120 


cggcgaatca 


gaagcagcaa 


gcaactttga 


ctgctgtctt ggatacacag accgtattct 


180 


tcatcctaaa 


tttattgtgg 


gcttcacacg 


gcagctggcc aatgaaggct gtgacatcaa 


240 


tgctatcatc 


tttcacacaa 


agaaaaagtt 


gtctgtgtgc gcaaatccaa aacagacttg 


300 


ggtgaaatat 


attgtgcgtc 


tcctcagtaa 


aaaagtcaag aacatgtaaa aactgtggct 


360 


tttctggaat 


ggaattggac 


atagcccaag 


aacagaaaga accttgctgg ggttggaggt 


420 


ttcacttgca 


catcatggag 


ggtttagtgc 


ttatctaatt tgtgcctcac tggacttgtc 


480 


caattaatga 


agttgattca 


tattgcatca 


tagtttgctt tgtttaagca tcacattaaa 


540 




aCU t uatyCt 


+- 4- 4- -> 4- i~> 4- 

actxacagct 


gtaggttttc tgtgtttagc tatttaatac 


600 


taattttcca 


taagctattt 


tggtttagtg 


caaagtataa aattatattt gggggggaat 


660 


aagattatat 


ggactttctt 


gcaagcaaca 


agctattttt taaaaaaact atttaacatt 


720 


cttttgttta 


tattgttttg 


tctcctaaat 


tgttgtaatt gcattataaa ataagaaaaa 


780 


cattaataag 


acaaatatt 






799 



<210> 33 

<211> 1901 

<212> DNA 

<213> Homo Sapiens . 

<400> 33 



gggcaggaag 


acggcgctgc 


ccggaggagc 


ggggcgggcg ggcgcgcggg ggagcgggcg 


60 


gcgggcggga 


gccaggcccg 


ggcgggggcg 


ggggcggcgg ggccagaaga ggcggcgggc 


120 


cgcgctccgg 


ccggtctgcg 


gcgttggcct 


tggctttggc tttggcggcg gcggtggaga 


180 


agatgctgca 


gtccctggcc 


ggcagctcgt 


gcgtgcgcct ggtggagcgg caccgctcgg 


240 


cctggtgctt 


cggcttcctg 


gtgctgggct 


acttgctcta cctggtcttc ggcgcagtgg 


300 


tcttctcctc 


ggtggagctg 


ccctatgagg 


acctgctgcg ccaggagctg cgcaagctga 


360 


agcgacgctt 


cttggaggag 


cacgagtgcc 


tgtctgagca gcagctggag cagttcctgg 


420 


gccgggtgct 


ggaggccagc 


aactacggcg 


tgtcggtgct cagcaacgcc tcgggcaact 


480 


ggaactggga 


cttcacctcc 


gcgctcttct 


tcgccagcac cgtgctctcc accacaggtt 


540 


atggccacac 


cgtgcccttg 


tcagatggag 


gtaaggcctt ctgcatcatc tactccgtca 


600 


ttggcattcc 


cttcaccctc 


ctgttcctga 


cggctgtggt ccagcgcatc accgtgcacg 


660 


tcacccgcag 


gccggtcctc 


tacttccaca 


tccgctgggg cttctccaag caggtggtgg 


720 


ccatcgtcca 


tgccgtgctc 


cttgggtttg 


tcactgtgtc ctgcttcttc ttcatcccgg 


780 


ccgctgtctt 


ctcagtcctg 


gaggatgact 


ggaacttcct ggaatccttt tatttttgtt 


840 
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ttatttccct 


gagcaccatt 


ggcctggggg 


attatgtgcc 


tggggaaggc tacaatcaaa 


900 


aattcagaga 


gctctataag 


attgggatca 


cgtgttacct 


gctacttggc cttattgcca 


960 


tgttggtagt 


tctggaaacc 


ttctgtgaac 


tccatgagct 


gaaaaaattc agaaaaatgt 


1020 


tctatgtgaa 


gaaggacaag gacgaggatc 


aggtgcacat 


catagagcat gaccaactgt 


1080 


ccttctcctc 


gatcacagac 


caggcagctg 


gcatgaaaga 


ggaccagaag caaaatgagc 


1140 


cttttgtggc 


cacccagtca 


tctgcctgcg 


tggatggccc 


tgcaaaccat tgagcgtagg 


1200 


atttgttgca 


ttatgctaga 


gcaccagggt 


cagggtgcaa 


ggaagaggct taagtatgtt 


1260 


catttttatc 


agaatgcaaa 


agcgaaaatt 


atgtcacttt 


aagaaatagc tactgtttgc 


1320 


aatgtcttat 


taaaaaacaa 


caaaaaaaga 


cacatggaac 


aaagaagctg tgaccccagc 


1380 


aggatgtcta 


atatgtgagg 


aaatgagatg 


tccacctaaa 


attcatatgt gacaaaatta 


1440 


tctcgacctt 


acataggagg 


agaatacttg 


aagcagtatg 


ctgctgtggt tagaagcaga 


1500 


ttttatactt 


ttaactggaa 


actttggggt 


ttgcatttag 


atcatttagc tgatggctaa 


1560 


atagcaaaat 


ttatatttag 


aagcaaaaaa 


aaaaagcata 


gagatgtgtt ttataaatag 


1620 


gtttatgtgt. 


actggtttgc 


atgtacccac 


ccaaaatgat 


tatttttgga gaatctaagt 


1680 


caaactcact 


atttataatg cataggtaac 


cattaactat 


gtacatataa agtataaata 


1740 


tgtttatatt 


ctgtacatat 


ggtttaggtc 


accagatcct 


agtgtagttc tgaaactaag 


1800 


actatagata 


ttttgtttct 


tttgatttct 


ctttatacta 


aagaatccag agttgctaca 


1860 


ataaaataag 


gggaataata 


aacttgagag 


tgaataacca 


t 


1901 


<210> 34 

<211> 492 

<212> DNA 

<213> Homo Sapiens 










<400> 34 
aaagggactc 


cttgaaactg 


attgagagcc 


cagtggattt 


gccagcagtt tgagcttcta 


60 


ccgagtcttc 


ccccacctca 


atccctgttg 


ctatggagac 


taccaatgga acggagacct 


120 


ggtatgagag 


cctgcatgcc 


gtgctgaagg 


ctctaaatgc 


cactcttcac agcaatttgc 


180 


tctgccggcc 


agggccaggg 


ctggggccag 


acaaccagac 


tgaagagagg cgggccagcc 


240 


tacctggccg tgatgacaac 


tcctacatgt 


acattctctt 


tgtcatgttt ctatttgctg 


300 


taactgtggg cagcctcatc 


ctgggataca 


cccgctcccg 


caaagfcggac aagcgtagtg 


360 


acccctatca tgtgtatatc 


aagaaccgtg 


tgtctatgat 


ctaacacgag agggctggga 


420 


c 99tggaaga 


ccaagacacc 


tggggattgc 


gtctggggcc 


tccagaactc tgctgtggac 


480 


tgcatcaggt 


ct 








492 
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<210> 35 

.<2ll> 14756 

<212> DNA 

<213> Homo Sapiens 

<400> 35 



ctgggcggcc 


gggcgcgggg agagggcgcg 


ggagcggctc gtgcggcagg 


taccatgcgg 


60 


acgcgcgagc 


ccggcgaggc cccggcaggc 


ccgtccctgc tcgggggcgc 


gctgagacgg 


120 


cgggtgagct 


ccacgagagc gccgtcgcca 


cttcgggcca actttgcgat 


tcccgacagt 


180 


taagcaatgg 


ggagacattt ggctttgctc 


ctgcttctgc tccttctctt 


ccaacatttt 


240 


ggagacagtg 


atggcagcca acgacttgaa 


cagactcctc tgcagtttac 


acacctcgag 


300 


tacaacgtca 


ccgtgcagga gaactctgca 


gctaagactt atgtggggca 


tcctgtcaag 


360 


atgggtgttt 


acattacaca tccagcgtgg 


gaagtaaggt acaaaattgt 


ttccggagac 


420 


agtgaaaacc 


tgttcaaagc tgaagagtac 


attctcggag acttttgctt 


tctaagaata 


480 


aggaccaaag 


gaggaaatac agctattctt 


aatagagaag tgaaggatca 


ctacacattg 


540 


atagtgaaag 


cacttgaaaa aaatactaat 


gtggaggcgc gaacaaaggt 


cagggtgcag 


600 


gtgctggata 


caaatgactt gagaccgtta 


ttctcaccca cctcatacag 


cgtttcttta 


660 


cctgaaaaca 


cagctataag gaccagtatc 


gcaagagtca gcgccacgga 


tgcagacata 


720 


ggaaccaacg 


gggaatttta ctacagtttt 


aaagatcgaa cagatatgtt 


tgctattcac 


780 


ccaaccagtg 


gtgtgatagt gttaactggt 


agacttgatt acctagagac 


caagctctat 


840 


gagatggaaa 


tcctcgctgc ggaccgtggc 


atgaagttgt atgggagcag 


tggcatcagc 


900 


agcatggcca 


agctaacggt gcacatcgaa 


caggccaatg aatgtgctcc 


ggtgataaca 


960 


gcagtgacat 


tgtcaccatc agaactggac 


agggacccag catatgcaat 


tgtgacagtg 


1020 


gatgactgcg 


atcagggtgc caatggtgac 


atagcatctt taagcatcgt 


ggcaggtgac 


1080 


cttctccagc 


agtttagaac agtgaggtcc 


tttccaggga gtaaggagta 


taaagtcaaa 


1140 


gccatcggtg 


acattgattg ggacagtcat 


cctttcggct acaatctcac 


actacaggct 


1200 


aaagataaag 


gaactccgcc ccagttctct 


tctgttaaag tcattcacgt 


gacttctcca 


1260 


cagttcaaag 


ccgggccagt caagtttgaa 


aaggatgttt acagagcaga 


aafcaagtgaa 


1320 


tttgctcctc 


ccaacacacc tgtggtcatg 


gtaaaggcca ttcctgctta 


ttcccatttg 


1380 


aggtatgttt 


ttaaaaggac acctggaaaa 


gctaaattca gttfaaatta 


caacactggt 


1440 


ctcatttcta 


ttttagaacc agttaaaaga 


cagcaggcag cccattttga 


acttgaagta 


1500 


acaacaagtg 


acagaaaagc gtccaccaag 


gtcttggtga aagtcttagg 


tgcaaatagc 


1560 


aatccccctg 


aatttaccca gacagcgtac 


aaagctgctt ttgatgagaa 


cgtgcccatt 


1620 
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ggtactacta 


tcatgagcct 


gagtgccgta 


gaccctgatg 


agggtgagaa 


tgggtacgtg 


1680 


acatacagta 


tcgcaaattt 


aaatcatgtg 


ccgtttgcga 


ttgaccattt 


cactggtgcc 


1740 


gtgagtacgt 


cagaaaacct 


ggactacgaa 


ctgatgcctc 


gggtttatac 


tctgaggatt 


18O0 


cgtgcatcag 


actggggctt 


gccgtaccgc 


cgggaagtcg 


aagtccttgc 


tacaattact 


1860 


ctcaataact 


tgaatgacaa 


cacacctttg 


tttgagaaaa 


taaattgtga 


agggacaatt 


1920 


cccagagatc 


taggcgtggg 


agagcaaata 


accactgttt 


ctgctattga 


tgcagatgaa 


1980 


cttcagttgg 


tacagtatca 


gattgaagct 


ggaaatgaac 


tggatttgtt 


tagtttaaac 


2040 


cccaactcgg 


gggtattgtc 


attaaagcga 


tcgctaatgg 


atggcttagg 


tqcaaaqqta 


2100 


tctttccaca 


gtctgagaat 


cacagctaca 


gatggagaaa 


attttgccac 


accattatat 


2160 


atcaacataa 


cagtggctgc 


cagtcacaag 


ctggtaaact 


tgcagtgtga 


agagactgqt 


2220 


gttgccaaaa 


tgctggcaga 


gaagctcctg 


caggcaaata 


aattacacaa 


ccagggagag 


2280 


gtggaggata 


ttttcttcga 


ttctcactct 


gtcaatgctc 


acataccgca 


gtttagaagc 


2340 


actcttccga 


ctggtattca 


ggtaaaggaa 


aaccagcctg 


tgggttccag 


tgtaattttc 


2400 


atgaactcca 


ctgaccttga 


cactggcttc 


aatggaaaac 


tggtctatgc 


tgtttctgga 


2460 


ggaaatgagg 


atagttgctt 


catgattgat 


atggaaacag 


gaatgctgaa 


aattttatct 


2520 


cctcttgacc 


gtgaaacaac 


agacaaatac 


accctgaata 


ttaccgtcta 


tqaccttqqq 


2580 


ataccccaga 


aggctgcgtg 


gcgtctfccta 


catgtcgtgg 


ttgtcgatgc 


caatgataat 


2640 


ccacccgagt 


ttttacagga 


gagctatttt 


qtqqaaqtqa 


gtgaagacaa 


qqaQqtacat 


2700 


agtgaaatca 


tccaggttga 


agccacagat 


aaagacctgg 


qqcCCaaCQQ 


acacgtgacg 


2760 


tactcaattc 


ttacagacac 


agacacattt 


tcaattgaca 


qcqtqacqqq 


tgttgttaac 


2820 


atcgcacgcc 


ctctggatcg 


agagctgcag 


catgagcact 


ccttaaagat 


tqaqqccaqq 


2880 


gaccaagcca 


gagaagagcc 


tcagctgttc 


tccactgtcg 


ttgtgaaagt 


atcactagaa 


2940 


gatgttaatg 


acaacccacc 


tacatttatt 


ccacctaatt 


atcgtgtgaa 


agtccgagag 


3000 


gatcttccag 


aaggaaccgt 


catcatgtgg 


ttagaagccc 


acgatcctga 


tttaggtcag 


3060 


tctggtcagg 


tgagatacag 


ccttctggac 


cacggagaag 


gaaacttcga 


tgtggataaa 


3120 


cucagnggag 


cagttaggat 


cgtccagcag 


ttggaccttg 


agaagaagca 


agtgtataat 


3180 


ctcactgtga 


gggccaaaga 


caagggaaag 


ccagtttctc 


tgtcttctac 


ttgctatgtt 


3240 


gaagttgagg 


tggttgatgt 


gaatgagaac 


ctgcacccac 


ccgtgttttc 


cagctttgtg 


3300 


gaaaagggga 


cagtgaaaga 


agatgcacct 


gttggttcat 


tggtaatgac 


ggtgtcggct 


3360 


catgatgagg 


acgccggaag 


agatggggag 


atccgatact 


ccattagaga 


tggctctggc 


3420 



WO 2004/U73657 PCT/US2004/005455 

58/282 



gttggtgttt 


tcaaaatagg 


tgaagagaca 


ggtgtcatag 


agacgtcaga 


tcgactggac 


3480 


cgtgaatcga 


cctcccatta 


ttggctaaca 


gtctttgcaa 


ccgatcaggg 


tgtcgtgcct 


3540 


ctttcatcgt 


tcatagagat 


ctacatagag 


gttgaggatg 


tcaatgacaa 


tgcaccacag 


3600 


acatcagagc 


ctgtttatta 


cccagaaatc 


atggaaaatt 


ctcctaaaga 


tgtatctgtg 


3660 


gtccagatcg 


aggcatttga 


tccagattcg 


agctctaatg 


acaagctcat 


gtacaaaatt 


3720 


acaagtggaa 


atccacaagg 


attcttttca 


atacatccta 


aaacaggtct 


catcacaact 


3780 


acgtcaagga 


agctagaccg 


agaacagcaa 


gatgaacaca 


tattagaggt 


tactgtgaca 


3840 


gacaatggta 


gtccccccaa 


atcaaccatt 


gcaagagtca 


ttgtgaaaat 


ccttgatgaa 


3900 


aatgacaaca 


aacctcagtt 


tctgcaaaag 


ttctacaaaa 


tcagactccc 


tgagcgggaa 


3960 


aagccagacc 


gagaaagaaa 


tgccagacgg 


gagccgctct 


atcgcgtcat 


agccaccgac 


4020 


aaggatgagg 


gccccaatgc 


agaaatctcc 


tacagcatcg 


aagacgggaa 


tgagcatggc 


4080 


aaatttttca 


tcgaaccgaa 


aactggagtg 


gtttcgtcca 


agaggttttc 


agcagctgga 


4140 


gaatatgata 


ttctttcaat 


taaggcagtt 


gacaatggtc 


gccctcaaaa 


gtcatcaacc 


4200 


accagactcc 


atattgaatg 


gatctccaag 


cccaaacagt 


ccctggagcc 


catttcattt 


4260 


gaagaatcat 


tttttacctt 


tactgtgatg 


gaaagtgacc 


ccgttgctca 


ca tgattgga 


4320 


gtaatatctg 


tggagcctcc 


tggcataccc 


ctttggtttg 


acatcactgg 


tggcaactac 


4380 


gacagtcact 


tcgatgtgga 


caagggaact 


ggaaccatca 


ttgttgccaa 


acctcttgat 


4440 


gcagaacaga 


agtcaaacta 


caacctcaca 


gtcgaggcta 


cagatggaac 


caccactatc 


4500 


ctcactcagg 


tattcatcaa 


agtaatagac 


acaaatgacc 


atcgtcctca 


gttttctaca 


4560 


tcaaagtatg 


aagttgttat 


tcctgaagat 


acagcgccag 


aaacagaaat 


tttgcaaatc 


4620 


agtgctgtgg 


atcaggatga 


gaaaaacaaa 


ctaatctaca 


ctctgcagag 


cagtagagat 


4680 


ccactgagtc 


tcaagaaatt 


tcgtcttgat 


cctgcaaccg 


gctctctcta 


tacttctgag 


4740 


aaactggatc 


atgaagctgt 


ttcaccagca 


cacctcacgg 


tcatggtacg 


agatcaagat 


4800 


gtgcctgtaa 


aacgcaactt 


tgcaaggatt 


gtggtcaatg 


tcagcgacac 


gaatgaccac 


4860 


gccccgtggt 


tcaccgcttc 


ctcctacaaa 


gggcgggttt 


atgaatcggc 


agccgttggc 


4920 


tcagttgtgt 


tgcaggtgac 


ggctctggac 


aaggacaaag 


ggaaaaatgc 


tgaagtgctg 


4980 


tactcgatcg 


agtcaggaaa 


tattggaaat 


attggaaatt 


cttttatgat 


tgatcctgtc 


5040 


ttgggctcta 


ttaaaactgc 


caaagaatta 


gatcgaagta 


accaagcgga 


gtatgattta 


5100 


atggtaaaag 


ctacagataa 


gggcagtcca 


ccaatgagtg 


aaataacttc 


tgtgcgtatc 


5160 


tttgtcacaa 


ttgctgacaa 


cgcctctccg 


aagtttacat 


caaaagaata 


ttctgttgaa 


5220 


cttagtgaaa 


ctgtcagcat 


tgggagtttc 


gttgggatgg 


ttacagccca 


tagtcaatca 


5280 
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tcagtggtgt atgaaataaa agatggaaat acaggtgatg cttttgatat taatccacat 5340 

tctggaacta tcatcactca gaaagccctg gactttgaaa ctttgcccat ttacacattg 5400 

ataatacaag gaactaacat ggctggtttg tccactaata caacggttct agttcacttg 5460 

caggatgaga atgacaacgc gccagttttt atgcaggcag aatatacagg actcattagt 5520 

gaatcagcct caattaacag cgtggtccta acagacagga atgtcccact ggtgattcga 5580 

gcagctgatg ctgataaaga ctcaaatgct ttgcttgtat atcacattgt tgaaccatct 564 0 

gtacacacat attttgctat tgattctagc actggtgcta ttcatacagt actaagtctg 5700 

gactatgaag aaacaagtat ttttcacttt accgtccaag tgcatgacat gggaacccca 5760 

cgtttatttg ctgagtatgc agcgaatgta acagtacatg taattgacat taatgactgc 5820 

ccccctgtgt ttgccaagcc attatatgaa gcatctcttfc tgttaccaac atacaaagga 5880 

gtaaaagtca tcacagtaaa tgctacagat gctgattcaa gtgcattctc acagttgatt 5940 

tactccatca ccgaaggcaa catcggggag aagttttcta tggactacaa gactggtgct 6000 

ctcactgtcc aaaacacaac tcagttaaga agccgctacg agctaaccgt tagagcttcc 6060 

gatggcagat ttgccggcct tacctctgtc aaaattaatg tgaaagaaag caaagaaagt 6120 

cacctaaagt ttacccagga tgtctactct gcggtagtga aagagaattc caccgaggcc 6180 

gaaacattag ctgtcattac tgctattggg agtccaatca atgagccttt gttttatcac 624 0 

atcctcaacc cagatcgcag atttaaaata agccgcactt caggggttct gtcaaccact 6300 

ggcacgccct tcgatcgtga gcagcaggag gcgtttgatg tggttgtaga agtgatagag 6360 

gaacataagc cttctgcagt ggcccacgtt gtcgtgaagg tcattgtaga agaccaaaat 6420 

gataatgcgc cggtgtttgt caaccttccc tactacgccg ttgttaaagt ggacactgag 6480 

gtgggccatg tcattcgcta tgtcactgct gtagacagag acagtggcag aaacggggaa 6540 

gtgcattact acctcaagga acatcatgaa cactttcaaa ttggaccctt gggtgaaatt 6600 

tcactgaaaa agcaatttga gcttgacacc ttaaataaag aatatcttgfc tacagtggtt 6660 

gcaaaagatg gagggaaccc ggccttttca gcggaagtta tcgttccgat cactgtcatg 6720 

aataaagcca tgcctgtgtt tgaaaaacct ttctacagtg cagagattgc agagagcatc 6780 

caggtgcaca gccctgtggt ccacgtgcag gctaacagcc cggaaggcct gaaagtgttc 6840 

tacagcatca cagacggaga ccctttcagc cagttcacta ttaacttcaa tactggagtt 6900 

atcaatgtca tagctcctct ggactttgag gcccacccgg catataagct gagcatacgc 6960 

gcaactgact ccttgacggg cgctcatgct gaagtatttg tggacatcat agtagacgac 7020 

atcaatgata accctcctgt gtttgctcag cagtcttatg cggtgaccct gtctgaggca 7080 
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tctgtaattg gaacgtctgt tgttcaagtt agagccaccg attctgattc agaaccaaat 7140 

agaggaatct cataccagat gtttgggaat cacagcaaga gtcatgatca ttttcatgta 7200 

gacagcagca ctggcctcat ctcactactc agaaccctgg attacgagca gtcccggcag 7260 

cacacgattt ttgtgagggc agttgatggt ggtatgccca cgctgagcag tgatgtgatt 7320 

gtcacggtgg acgttaccga cctoaatggt aatccaccac tctttgaaca acagatttat 7380 

gaagccagaa ttagcgagca cgcccctcat gggcatttcg tgacctgtgt aaaagcctat 7440 

gatgcagaca gttcagacat agacaagttg cagtattcca ttctgtctgg caatgatcat 7500 

aaacattttg tcattgacag tgcaacaggg attatcaccc tctcaaacct gcaccggcac 7560 

gccctgaagc cattttacag tcttaacctg tcagtgtctg atggagtttt tagaagttcc 7620 

acccaggttc atgtaactgt aattggaggc aatttgcaca gtcctgcttt ccttcagaac 7680 

gaatatgaag tggaactagc tgaaaacgct cccctacata ccctggtgat ggaggtgaaa 7740 

actacggatg gggattctgg tatttatggt cacgttactt accatattgt aaatgacttt 7800 

gccaaagaca gattttacat aaatgagaga ggacagatat ttactttgga aaaacttgat 7860 

cgagaaaccc cggcggagaa agtgatctca gtccgtttaa tggctaagga tgctggagga 7920 

aaagttgctt tctgcaccgt gaatgtcatc cttacagatg acaatgacaa tgcaccacaa 7980 

tttcgagcaa ccaaatacga agtgaatatc gggtccagtg ctgctaaagg gacttcagtc 8040 

gtaaagtctg caagtgatgc cgatgagggc tccaatgccg acatcaccta tgccafctgaa 8100 

gcagactctg aaagtgtaaa agagaatttg gaaattaaca aactgtccgg cgtaatcact 8160 

acaaaggaga goctcattgg cttggaaaat gaattcttca ctttctttgt tagagctgtg 8220 

gataatgggt ctccatcaaa agaatctgtt gttcttgtct atgttaaaat ccttccaccg 8280 

gaaatgcagc ttccaaaatt ttcagaacct tfcctatacct ttacagtgtc agaggacgtg 8340 

cctgttggaa cagagataga tctcatccga gcagaacata gtgggactgt tctttacagc 8400 

ctggtcaaag ggaatactcc agaaagcaat agggatgagt cctttgtgat tgacagacag 8460 

agcgggagac tgaagttgga gaagagtctt gatcatgaga caactaagtg gtatcagttt 8520 

tccatactgg ccaggtgcac tcaagatgac catgagatgg tggcttctgt agatgttagt 8580 

atccaagtga aagatgcaaa tgacaacagc ccggtctttg aatctagtcc atatgaggca 8640 

ttcattgttg aaaacctgcc agggggaagt agagtaattc agatcagggc atctgatgct 8700 

gactoaggaa ccaacggcca agttatgtat agcctggatc agtcacaaag tgtggaagtc 8760 

attgaatcct ttgccattaa catggaaaca ggctggatta caactttaaa ggaacttgac 8820 

catgaaaaga gagacaatta ccagattaaa gtggttgcat cagatcatgg tgaaaagatc 8880 

oagctatcot ccacagccat tgtggatgtt accgtcaccg atgtcaacga tagtccacca 8940 
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cgattcacgg 


ccgagatcta 


taaagggact gtgagtgagg atgaccccca aggtggggtg 


9000 


attgccatct 


taagtaccac 


ggatgctgat tctgaagaga tcaacagaca agttacatat 


9060 


ttcataacag 


gaggggatcc 


tttaggacag tttgccgttg aaactataca gaatgaatgg 


9120 


aaggtatatg 


tgaagaaacc 


tctagacagg gaaaaaaggg acaattacct tcttactatc 


9180 


acggcaactg 


atggcacctt 


ctcatcaaaa gcgatagttg aagtgaaagt tctggatgca 


9240 


aatgacaaca 


gtccagtttg 


tgaaaagact ttatattcag acactattcc tgaagacgtc 


9300 


cttcctggaa 


aattgatcat 


gcagatctct gctacagacg cagacatccg ctctaacgct 


9360 


gaaattactt 


acacgttatt 


gggttcaggt gcagaaaaat tcaaactaaa tccagacaca 


9420 


ggtgaactga 


aaacgtcaac 


cccccttgat cgtgaggagc aagctgttta tcatcttctc 


9480 


gtcagggcca 


cagatggagg 


aggaagattc tgccaagcca gtattgtcgt cacgctagaa 


9540 


gatgtgaacg 


ataacgcccc 


cgaattctct gccgatcctt atgccatcac cgtgtttgaa 


9600 


aacacagagc 


cgggaacgct 


gctgacaaga gtgcaggcca cagatgccga cgcaggatta 


9660 


aatcggaaga 


ttttatactc 


actgattgac tctgctgatg ggcagttctc cattaacgaa 


9720 


ttatctggaa 


ttattcagtt 


agaaaaacct ttggacagag aactccaggc agtatacacc 


9780 


ctctctttga 


aagctgtgga 


tcaaggcttg ccaaggaggc tgactgccac tggcactgtg 


9840 


attgtatcag 


ttcttgacat 


aaatgacaac ccccctgtgt ttgagtaccg tgaatatggt 


9900 


gccaccgtgt 


etgaggacat 


tcttgttgga actgaagttc ttcaagtgta tgcagcaagt 


9960 


cgggatattg 


aagcaaatgc 


agaaatcacc tactcaataa taagtggaaa tgaacatggg 


10020 


aaattcagca 


tagattctaa 


aacaggggcc gtatttatca ttgagaatct ggattatgag 


10080 


agctctcatg 


agtattacct 


aacagtagag gccactgatg gaggcacgcc ttcactgagc 


10140 


gacgttgcca 


ctgtgaacgt 


taatgtaaca gatatcaacg ataatacccc tgtgttcagc 


10200 


caagacacct 


acacgacagt 


catcagtgaa gatgccgttc ttgagcagtc tgtcatcacg 


10260 


gttatggccg 


atgatgccga 


tggaccttcc aacagccaca tccactactc aattatagat 


10320 


ggcaaccaag 


gaagctcgtt 


cacaattgac cccgtcaggg gagaagtcaa agtgaccaaa 


10380 


cttctcgacc 


gagaaacgat 


ttcaggttac acgctcacgg ttcaagcttc tgataatggc 


10440 




gagt-caacac 


gacgaccgtg aacatcgatg tgtccgatgt caatgacaac 


10500 


gcgcccgtct 


tctccagggg 


aaactacagt gtcattatcc aggaaaataa gccagtgggc 


10560 


ttcagcgtgc 


tgcagctggt 


agtaacagat gaggattctt cccataacgg tccacccttc 


10620 


ttctttacta 


ttgtaactgg 


aaatgatgag aaggcttttg aagttaaccc gcaaggagtc 


10680 


ctcctgacat 


catctgccat 


caagaggaag gagaaagatc attacttact gcaggtgaag 


10740 
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gtggcagata atggaaagcc tcagttgtca tctttgacat acattgacat tagggtaatt 10800 

gaggagagca tctatccgcc tgcgattttg cccctggaga ttttcatcac ctcttctgga 10860 

gaagaatact caggtggcgt cattgggaag atccatgcca cagaccagga cgtgtatgat 10920 

actctaacct acagtctcga ccctcagatg gacaacctgt tctctgtttc cagcacaggg -10980 

ggcaagctga tagcacacaa aaagctagac atagggcaat accttctcaa tgtcagcgta 11040 

acagatggga agttcacgac ggtggccgac atcacagtgc atatcagaca agtcacacag 11100 

gagatgttga accacaccat cgcgatccgc tttgccaacc tcactccgga agaattcgtt 11160 

ggtgactact ggcgcaactt ccagcgagct ttacggaaca tcctgggtgt gaggaggaac 11220 

gacatacaga ttgttagttt gcagtcctct gaacctcacc cacatctgga cgtcttactt 11280 

tttgtagaga aaccaggtag tgctcagatc tcaacaaaac aacttctgca caagattaac 11340 

tcttccgtga ctgacattga ggaaatcatt ggagttagga tactgaatgt attccagaaa 11400 

ctctgcgcgg gactggactg cccctggaag ttctgcgatg aaaaggtgtc tgtggatgaa 11460 

agtgtgatgt caacacacag cacagccaga ctgagttttg tgactccccg ccaccacagg 11520 

gcagcggtgt gtctctgcaa agagggaagg tgcccacctg tccaccatgg ctgtgaagat 11580 

gatccgtgcc ctgagggatc cgaatgtgtg tctgatccct gggaggagaa acacacctgt 11640 

gtctgtccca gcggcaggtt tggtcagtgc ccagggagtt catctatgac actgactgga 11700 

aacagctacg tgaaataccg tctgacggaa aatgaaaaca aattagagat gaaactgacc 11760 

atgaggctca gaacatattc cacgcatgcg gttgtcatgt atgctcgagg aactgactat 11820 

agcatcttgg agattcatca tggaaggctg cagtacaagt ttgactgtgg aagtggccct 11880 

ggaattgtct ctgttcagag cattcaggtc aatgatgggc agtggcacgc agtggccctg 1134 0 

gaagtgaatg gaaactatgc tcgcttggtt ctagaccaag ttcatactgc atcgggcaca 12000 

gccccaggga ctctgaaaac cctgaacctg gataactatg tgttttttgg tggccacatc 12060 

cgtcagcagg gaacaaggca tggaagaagt cctcaagttg gtaatggttt caggggttgt 12120 

atggactcca tttatttgaa tgggcaggag ctccctttaa acagcaaacc cagaagctat 12180 

gcacacatcg aagagtcggt ggatgtatct ccaggctgct tcctgacggc cacggaagac 12240 

tgcgccagca acccttgcca gaatggaggc gtttgcaatc cgtcacctgc tggaggttat 12300 

tactgcaaat gcagtgcctt gtacataggg acccactgtg agataagcgt caatccgtgt 12360 

tcctccaacc catgcctcta tgggggcacg tgtgttgtcg acaacggagg ctttgtttgc 12420 

cagtgtagag gattatatac tggtcagagg tgtcagctta gtccatactg caaagatgaa 12480 

ccctgtaaga atggcggaac atgctttgac agtttggatg gcgccgtttg tcagtgtgat 12540 

tcgggtttta ggggagaaag gtgtcagagt gatatcgacg agtgctctgg aaacccttgc 12600 
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ctgcacgggg ccctctgtga gaacacgcac ggctcctatc actgcaactg cagccacgag 12660 
tacaggggac gtcactgcga ggatgctgcg cccaaccagt atgtgtccac gccgtggaac 12720 
attgggttgg cggaaggaat tggaatcgtt gtgtttgttg cagggatatt tttactggtg 12780 
gtggtgtttg ttctctgccg taagatgatt agtcggaaaa agaagcatca ggctgaacct 12840 
aaagacaagc acctgggacc cgctacggct ttcttgcaaa gaccgtattt tgattccaag 12900 
ctaaataaga acatttactc agacatacca ccccaggtgc ctgtccggcc tatttcctac 12960 
accccgagta ttccaagtga ctcaagaaac aatctggacc gaaattcctt cgaaggatct 13020 
gctatcccag agcatcccga attcagcact tttaaccccg agtctgtgca cgggcaccga 13 080 
aaagcagtgg cggtctgcag cgtggcgcca aacctgcctc ccccaccccc ttcaaactcc 13140 
ccttctgaca gcgactccat ccagaagcct agctgggact ttgactatga cacaaaagtg 13200 
gtggatcttg atccctgtct ttccaagaag cctctagagg aaaagccttc ccagccatac 13260 
agtgcccggg aaagcctgtc tgaagtgcag tccctgagct ccttccagtc cgaatcgtgc 13320 
gatgacaatg ggtatcactg ggatacatca gattggatgc caagcgttcc tctgccggac 13380 
atacaagagt tccccaacta tgaggtgatt gatgagcaga cacccctgta ctcagcagat 13440 
ccaaacgcca tcgatacgga ctattaccct ggaggctacg acatcgaaag tgattttcct 13500 
ccacccccag aagacttccc cgcagctgat gagctaccac cgttaccgcc cgaattcagc 13560 
aatcagtttg aatccatcca ccctcctaga gacatgcctg ccgcgggtag cttgggttct 13620 
tcatcaagaa accggcagag gttcaacttg aatcagtatt tgcccaattt ttatcccctc ' 13680 
gatatgtctg aacctcaaac aaaaggcact ggtgagaata gtacttgtag agaaccccat 13 740 
gccccttacc cgccagggta tcaaagacac ttcgaggcgc ccgctgtcga gagcatgccc 13800 
atgtctgtgt acgcctccac cgcctcctgc tctgacgtgt cagcctgctg cgaagtggag 13860 
tccgaggtca tgatgagtga ctatgagagc ggggacgacg gccacttcga agaggtgacg 13920 
atcccgcccc tggattccca gcagcacacg gaagtctgac tctcaacfccc ccccaaagtg 13980 
cctgacttta gtgaacctag aggtgatgtg agtaatccgc gctgttcttt gcagcagtgc 14040 
ttccaagctt tttttggtga gccgaatggg catggctgcg ctggatcctg cgcctctgga 14100 
cgtgctagcc atttccagtg tcccaactac tgtcatcgtg aggttttcat cggctgtgcc 14160 
atttcccaac gtcttttggg atttacatct gtctgtgtta aaataatcaa acgaaaaatc 14220 
agtcctgtgt tgtcagcatg attcatgtat ttatatagat ttgattattt taattttcct 14280 
gtctcttttt tttgtaaatt ttatgtacag atttgatttt tcatagtttt aactagattt 14340 
ccaagatatt ttgtgcattt gtttcaactg aattttggtg gtgtcagtgc cattatctag 14400 
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caccctgatt tttttttttt tactataacc agggtttcat tctgtctttt tccactgaag 14460 

tgtgacattt tgttagtaca tttcagtgta gtcattcatt tctagctgta cataggatga 14S20 

aggagagatc agatacatga acatgtctta catgggttgc tgtatttaga attataaaca 24580 

fcttttcatta ttggaaagtg taacggggac cttctgcata cctgtttaga accaaaacca 14640 

ccatgacaca gtttttatag tgtctgtata tttgtgatgc aatggtcttg taaaggtttt 14 700 

taatgaaaac taccattagc cagtctttct tactgacaat aaattattaa taaaat 14756 

<210> 36 

<211> 1583 

<212> DNA 

<213> Homo Sapiens 

<400> 36 

gtggtgtttg ctttctccac cagaagggca cactttcatc taatttgggg tatcactgag 60 

ctgaagacaa agagaagggg gagaaaacct agcagaccac catgtgctat gggaagtgtg 120 

cacgatgcat cggacattct ctggtggggc tcgccctcct gtgcatcgcg gctaatattt 180 

tgctttactt tcccaatggg gaaacaaagt atgcctccga aaaccacctc agccgcttcg 240 

tgtggttctt ttctggcatc gtaggaggtg gcctgctgat gctcctgcca gcatttgtct 300 

tcattgggct ggaacaggat gactgctgtg gctgctgtgg ccatgaaaac tgtggcaaac 360 

gatgtgcgat gctttcttct gtattggctg ctctcattgg aattgcagga tctggctact 420 

gtgtcattgt ggcagccctt ggcttagcag aaggaccact atgtcttgat tccctcggcc 480 

agtggaacta cacctttgcc agcaccgagg gccagtacct tctggatacc tccacatggt 540 

ccgagtgcac tgaacccaag cacattgtgg aatggaatgt atctctgttt tctatcctct 600 

tggctcttgg tggaattgaa ttcatcttgt gtcttattca agtaataaat ggagtgcttg 660 

gaggcatatg tggcttttgc tgctctcacc aacagcaata tgactgctaa aagaaccaac 720 

ccaggacaga gccacaatct tcctctattt cattgtaatt tatatatttc acttgtattc 780 

atttgtaaaa ctttgtatta gtgtaacata ctccccacag tctactttta caaacgcctg 840 

taaagactgg catcttcaca ggatgtcagt gtttaaattt agtaaacttc? ttttttgttt 900 

gtttatttgt ttttgttttt ttttaaggaa tgaggaaaca aaccaccctc tgggggtagt 960 

ttacagactg agtgacagta ctcagtatat ctgagataaa ctctataatg ttttggataa 1020 

aaataacatt ccaatcacta ttgtatatat gtgcatgtat tttttaaatt aaagatgtct 1080 

agttgctttt tataagacca agaaggagaa aatccgacaa cctggaaaga tttttgtttt 1140 

cactgcttgt atgatgtttc ccattcatac acctataaat ctctaacaag aggccctttg 1200 

aactgccttg tgttctgtga gaaacaaata tttacttaga gtggaaggac tgattgagaa 1260 
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tgttccaatc caaatgaatg catcacaact tacaatgctg ctcattgttg tgagtactat 132 0 

gagattcaaa tttttctaac atatggaaag ccttttgtcc tccaaagatg agtactaggg 1380 

atcatgtgtt taaaaaaaga aaggctacga tgactgggca agaagaaaga tgggaaactg 144 0 

aataaagcag ttgatcagca tcatfcggaac atggggacga gtgacggcag gaggaccacg 1500 

aggaaatacc ctcaaaacta acttgtttac aacaaaataa agtattcact acgaaaaaaa 1560 

aaaaaaaaaa aaaaaaaaaa aaa 1583 

<210> 37 

<211> 7586 

<212> DNA 

<213> Homo Sapiens 

<400> 37 

gttctttgtg acacatcaca cagaattgga gtgctgtcct tctggagagt ggtggagaac 60 

caagatacag ttcagaacca aaggaataga gaagggcttt gatttctttt tggctttaga 12 0 

ttggggattt gggaggctta gcaggaaaga tgtccactga aaatgtggaa gggaagccca 180 

gtaaccttgg ggagagagga agagcccgga gctccacttt cctcagggtt gtccagccaa 240 

tgtttaacca cagtattttc acttctgcag tctctcctgc tgcagaacgc atccgattca 300 

tcttgggaga ggaggatgac agcccagctc cccctcagct cttcacggaa ctggatgagc 360 

tgctggccgt ggatgggcag gagatggagt ggaaggaaac agccaggtgg atcaagtttg 42 0 

aagaaaaagt ggaacagggt ggggaaagat ggagcaagcc ccatgtggcc acattgtccc 4 80 

ttcatagttt atttgagctg aggacatgta tggagaaagg atccatcatg cttgatcggg 540 

aggcttcttc tctcccacag ttggtggaga tgattgttga ccatcagatt gagacaggcc 600 

tattgaaacc tgaacttaag gataaggtga cctatacttt gctccggaag caccggcatc 660 

aaaccaagaa atccaacctt cggtccctgg ctgacattgg gaagacagtc tccagtgcaa 72 0 

gtaggatgtt taccaaccct gataatggta gcccagccat gacccatagg aatctgactt 780 

cctccagtct gaatgacatt tctgataaac cggagaagga ccagctgaag aataagttca 840 

tgaaaaaatt gccacgtgat gcagaagctt ccaacgtgct tgttggggag gttgactttt 900 

tggatactcc tttcattgcc tttgttaggc tacagcaggc tgtcatgctg ggtgccctga 960 

ctgaagttcc tgtgcccaca aggttcttgt tcattctctt aggtcctaag gggaaagcca 1020 

agtcctacca cgagattggc agagccattg ccaccctgat gtctgatgag gtgttccatg 1080 

acattgctta taaagcaaaa gacaggcacg acctgattgc tggtSl|^at gagttcctag 114 0 

atgaagtcat cgtccttcca cctggggaat gggatccagc aattaggata gagcctccta 1200 

agagtcttcc atcctctgac aaaagaaaga atatgtactc aggtggagag aatgttcaga 1260 
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tgaatgggga 


tacgccccat 


gatggaggtc acggaggagg aggacatggg gattgtgaag 


1320 


aattgcagcg 


aactggacgg 


ttctgtggtg gactaattaa agacataaag aggaaagcgc 


1360 


cattttttgc 


cagtgatttt 


tatgatgctt taaatattca agctctttcg gcaattctct 


1440 


tcatttatct 


ggcaactgta 


actaatgcta tcacttttgg aggactgctt ggggatgcca 


1500 


ctgacaacat 


gcagggcgtg 


ttggagagtt tcctgggcac tgctgtctct ggagccatct 


1560 


tttgcctttt 


tgctggtcaa 


ccactcacta ttctgagcag caccggacct gtcctagttt 


1620 


ttgagaggct 


tctatttaat 


ttcagcaagg acaataattt tgactatttg gagtttcgcc 


1680 


tttggattgg 


cctgtggtcc 


gccttcctat gtctcatttt ggtagccact gatgccagct 


1740 


tcttggttca 


atacttcaca 


cgtttcacgg aggagggctt ttcctctctg attagctfcca 


1800 


tctttatcta 


tgatgctttc 


aagaagatga tcaagcttgc agattactac cccatcaact 


1860 


ccaacttcaa 


agtgggctac 


aacactctct tttcctgtac ctgtgtgcca cctgacccag 


1920 


ctaatatctc 


aatatctaat 


gacaccacac tggccccaga gtatttgcca actatgtctt 


1980 


ctactgacat 


gtaccataat 


actacctttg actgggcatt tttgtcgaag aaggagtgtt 


2040 


caaaatacgg 


aggaaacctt 


gtcgggaaca actgtaattt tgttcctgat atcacactca 


2100 


tgtcttttat 


cctcttcttg 


ggaacctaca cctcttccat ggctctgaaa aaatfccaaaa 


2160 


ctagtcctta 


ttttccaacc 


acagcaagaa aactgatcag tgattttgcc attafccttgt 


2220 


ccattctcat 


cttttgtgta 


atagatgccc tagtaggcgt ggacacccca aaactaattg 


2280 


tgccaagtga 


gttcaagcca 


acaagtccaa accgaggttg gttcgttcca ccgtttggag 


2340 


aaaacccctg 


gtgggtgtgc 


cttgctgctg ctatcccggc tttgttggtc actatactga 


2400 


ttttcatgga 


ccaacaaatt 


acagctgtga ttgtaaacag gaaagaacat aaactcaaga 


2460 


aaggagcagg 


gtatcacttg 


gatctctttt gggtggccat cctcatggtt atatgctccc 


2520 


tcatggctct 


tccgtggtat 


gtagctgcta cggtcatctc cattgctcac atcgacagtt 


2580 


tgaagatgga 


gacagagact 


tctgcacctg gagaacaacc aaagtttcta ggagtgaggg 


2640 


aacaaagagt 


cactggaacc 


cttgtgttta ttctgactgg tctgtcagtc tttatggctc 


2700 


ccatcttgaa 


gtttataccc 


atgcctgtac tctatggtgt gttcctgtat atgggagtag 


2760 


catcccttaa 


tggtgtgcag 


ttcatggatc gtctgaagct gcttctgatg cctctgaagc 


2820 


atcagcctga 


cttcatctac 


ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt 


2880 


tcctgcaggt 


gttgtgtctg 


gccctgcttt ggatcctcaa gtcaacggtg gctgctatca 


2940 


tttttccagt 


aatgatcttg 


gcacttgtag ctgtcagaaa aggcatggac tacctcttct 


3000 


cccagcatga 


cctcagcttc 


ctggatgatg tcattccaga aaaggacaag aaaaagaagg 


3060 


aggatgagaa 


gaaaaagaaa 


aagaagaagg gaagtctgga cagtgacaat gatgattctg 


3120 
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actgcccata ctcagaaaaa gttccaagta ttaaaattcc aatggacatc atggaacagc 3ibo 

aacctttcct aagcgatagc aaaccttctg acagagaaag atcaccaaca ttccttgaac 3240 

gccacacatc atgctgataa aattcctttc cttcagtcac tcggtatgcc aagtcctcct 3300 

agaactccag taaaagttgc ctcaaattag actagaactt gaacctgaag acaatgatta 3360 

tttctggagg agcaagggaa cagaaactac attgtaacct gtttgtcttt cttaaaactg 3420 

acatttgttg ttaatgtcat ttgtttttgt ttggctgttt gtttattttt taacttttat 3480 

ttcgtctcag tttttggtca caggccaaat aatacagcgc tctctctgct tctctcttgc 3540 

atagatacaa tcaagacaat agtgcaccgt tccttaaaaa cagcatctga ggaatccccc 3 600 

ttttgttctt aaactttcag atgtgtcctt tgataaccaa attctgtcac tcaagacaca 3660 

gacacccaca gaccctgtcc tttgcctcta ttaagcagag gatggaagta ttaaggattt 3720 

tgtaacacct tttatgaaaa tgttgaagga acttaaaact ttagctttgg agctgtgctt 3780 

actggcttgt ctttgtctgg tagaacaaac cttgacctcc agacagagtc ccttctcact 3840 

tatagagctc tccaggactg gaaaaagtgc tgctatttta acttgctctt gcttgtaaat 3 900 

cctaatctta gagttatcaa aagaagaaaa aactgaaggt actttactcc ctatagagaa 3 960 

accattgcca tcattgtagc aagtgctgga atgtcccttt tttcctatgc aactttttta 4020 

taacccttta atgaacttat ctgtggagta cattgaagaa tatttttctt cctagatttt 4080 

gttgtttaaa ttatggggcc taacctgcca cttatttttt gtcaattttt aaaacttttt 4140 

tttaattact gtaaagaaaa tgaatttttt cctgcagcag gaaacatagt tttcagtagt 4200 

tctacctctt atttgtagct gccaggcttt ctgtaaaaat tgtattgtat ataatgtgat 4260 

ttttacacat acatacacac acaaatacac aatctctagg gtaagccaga aggcaagatc 4320 

agattaaaaa caccatgttt ctaagcatcc atttttccct ttctttaaaa gaaacttaac 4380 

tgttctatga aggagattga gggagaagag acaaactcct atgtcatgag aataaccgat 4440 

gttctgataa tagtagcatc taggtacaga tgctggttgt attaccacgt caatgtccta 4500 

tgcagtattg ttagacattt tctcattttg aaatatttgt gtgtttgtgt atgtgctctg 4560 

tgccatggct ggtgtatata tgtgcaatgt tagaaggcaa aagagtgatg gtaggcagag 4620 

ggcaaagtca ttgaatctct tatgccagtt ttcataaaac ccaaaccaca tatgaaaaaa 4680 

tccattaagg gtccaagaag tctgtccata tgaaaatgag ggtaaatata gtttatttcc 4740 

caggtatcag tcattataat tgatataata gctctaacat gcaatataaa attcatagga 4800 

gtattaatag cccatttaca catctataaa atgtaatggg attgcagagc tgcagagtac 4860 

agtgtaacag tactctcatg caattttttt caggatgcaa aggcaattat tctttgtaag 4920 



WO 2004/073657 PCT/US2004/005455 

68/282 

cgggacattt agatatattt gtgtacatat tatatgtatg tatatttcaa agtaccacac 4980 

tgaaaattag acatttatta accaaattta acgtggtatt taaaggtaat atttttaata 5040 

tgatacatta catattgtga atgtatacta aaaaaacatt ttaaatgtta aaattataat 5100 

ttcagattca tataaccaca actgtgatat atcctaacta taaccagttg ttgaggggta 5160 

tactagaagc agaatgaaac cacatttttt ggtttgataa tatgcactta ttgactccca 5220 

ctcattgtta tgttaattaa gttattattc tgtctccttg taattttgat tacaaaaatt 5280 

ttattatcct gagttagctg ttacttttac agtacctgat actcctaaaa cttttaactt 5340 

atacaaatta gtcaataatg accccaattt tttcattaaa ataatagtgg tgaattatat 5400 

gttattgtgt taaaacctca cttgccaaat tctggcttca catttgtatt tagggctatc 5460 

cttaaaatga tgagfcctata ttatctagct ttctattacc ctaatataaa ctggtataag 5520 

aagactttcc ttttttcttt atgcatggaa gcatcaataa attgtttaaa aaccatgtat 5580 

agtaaattca gcttaacccg tgatcttctt aagttaaagg tacttttgtt ttataaaagc 5640 

tctagataaa actttctttt ctgatcatga atcaagtatc tgtggtttca tgcccctctc 5700 

tatacctttc aaagaactcc tgaagcaact taactcatca tttcagcctc tgagtagagg 5760 

taaaacctat gtgtacttct gtttatgatc catattgata tttatgacafc gaacacagaa 5820 

tagtacctta catttgctaa acagacagtt aatatcaaat cctttcaata ttctgggaac 5880 

ccagggaagt ttttaaaaat gtcattactt tcaaaggaac agaagfcagtt aaccaaacta 5940 

acaagcaaaa cctgaggttt acctagtgac accaaattat cggtatttta actgaattta 6000 

cccattgact aagaatgaac cggatttggt ggtggttttg tttctatgca aactggacac 6060 

aaattacaac agtaaatttt tttataagtg cttctccctt ctccatgatg tgacttccgg 6120 

agataaagga ttcaaaagat aaagacaaag tacgctcaga gttgttaacc agaaagtcct 6180 

ggctgtggtt gcagaaacac tgttggaaga aaagagatga ctaagtcaag tgtctgcctt 6240 

atcaaaagag caaaaatgcc tctggttttg tgtttgggag aaaaatatct tggacgcact 6300 

gttttccttg ataaaagtca tcttctctac tgtgtgaaat gaatacttgg aattctaatt 6360 

gttttgtgtg ccaggggcag taatgtccct gcctcttctc ccaatcaagg ttgaggagtg 6420 

gggctgggga gaggacttaa ctgacttaag aagtaggaaa acaaaaacct ctctcctcag 6480 

ccttccacct ccaagagagg aggaaaaaca gttgtctgct gtctgtaatt cagtttgcgt 6540 

gtattttatg ctcatgcacc aacccataca gagtaaatct tttatcaact atatactggt 6600 

gtttaataga gaatgattgt cttccgagtt ttttggttcc ttttttaact gtgttaaagt 6660 

acttgaaatg tattgactgc tgactatatt ttaaaaacaa aatgaaataa tttgagttgt 6720 

attacagagg ttgacatfcgt tcagggatgg gacaaagcct tcttcaatcc ttttcatact 6780 
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acttaatgat 


tttggtgcag 


gaacctgaga ttttctgatt tatatttcat gatatttcac 


6840 


atttgctctt 


cacagcatga 


gcatgaagcc cagtggcacc aaatggctgg gtacaatcaa 


6900 


gtgatatttt 


gtagcacctc 


actatctgaa aggccatgag ttttcagatg atttcattga 


6960 


gcttcattgc 


agcctgaaat 


tttaaaaaag ttgtgtaata cgccaaccag tcaagttgtg 


7020 


ttttggccag 


agatttagat 


atgtccaatt tcctggctca tttcattgtg ctctatgggt 


7080 


acgtataaaa 


agcaagaatt 


ctgtttccta ggcaaacatt gcaactcagg gctaaagtca 


7140 


tccagtgaaa 


cttttagagc 


cagaagtaac tttgtcccag tcctacaatg tgaaaagagt 


7200 


gaatagttgc 


ctctttttag 


ccattttcat ggctggtaca tattcgtacg cattactttt 


7260 


cagaatcaat 


acgcactttc 


agatattctt atttttattc tcttaagtct ttattaactt 


7320 


tcrcracraa'aaa 


aataatacat 


(-L/iutLdciu (.actac^aa^i. agauCodCdt ggtggaacaa 


73 80 


aatgataaag 


aacagaaaac 


atttcaatat attactaata actttttcca atataaatcc 


7440 


taaaattcct 


ataacatagt 


attttacagt tttatgaagc tttctattgt gacttttatg 


7500 


gaattaagag 


atgaagaaga 


tgagatattt tagcatttat atttttcaaa attatatgta 


7560 


tacttaaaaa 


taaagtaact 


ttatgc 


7586 



<210> 38 

<211> 1958 

<212> DNA 

<213> Homo Sapiens 

<400> 38 



cggcacagcc 


tcacacctga 


acgctgtcct cccgcagacg agaccggcgg gcactgcaaa 


60 


gctgggactc 


gtctttgaag 


gaaaaaaaat agcgagtaag aaatccagca ccattcttca 


120 


ctgacccatc 


ccgctgcacc 


tcttgtttcc caagtttttg aaagctggca actctgacct 


180 


cggtgtccaa 


aaatcgacag 


ccactgagac cggctttgag aagccgaaga tttggcagtt 


240 


tccagactga 


gcaggacaag 


gtgaaagcag gttggaggcg ggtccaggac atctgagggc 


300 


tgaccctggg 


ggctcgtgag 


gctgccaccg ctgctgccgc tacagaccca gccttgcact 


360 


ccaaggctgc 


gcaccgccag 


ccactatcat gtccactccc ggggtcaatt cgtccgcctc 


420 


cttgagcccc 


gaccggctga 


acagcccagt gaccatcccg gcggtgatgt tcatcttcgg 


480 


ggtggtgggc 


aacctggtgg 


ccatcgtggt gctgtgcaag tcgcgcaagg agcagaagga 


540 


gacgaccttc 


tacacgctgg 


tatgtgggct ggctgtcacc gacctgttgg gcactttgtt 


600 


ggtgagcccg 


gtgaccatcg 


ccacgtacat gaagggccaa tggcccgggg gccagccgct 


660 


gtgcgagtac 


agcaccttca 


ttctgctctfc cttcagcctg tccggcctca gcatcatctg 


720 


cgccatgagt 


gtcgagcgct 


acctggccat caaccatgcc tatttctaca gccactacgt 


780 
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ggacaagcga 


ttggcgggcc 


tcacgctctt tgcagtctat gcgtccaacg tgctcttttg 


840 


cgcgctgccc 


aacatgggtc 


tcggtagctc gcggctgcag tacccagaca cctggtgctt 


900 


catcgactgg 


accaccaacg 


tgacggcgca cgccgcctac tcctacatgt acgcgggctt 


960 


cagctccttc 


ctcattctcg 


ccaccgtcct ctgcaacgtg cttgtgtgcg gcgcgctgct 


1020 


ccgcatgcac 


cgccagttca 


tgcgccgcac ctcgctgggc accgagcagc accacgcggc 


1080 


cgcggccgcc 


tcggttgcct 


cccggggcca ccccgctgcc tccccagcct tgccgcgcct 


1140 


cagcgacttt 


cggcgccgcc 


ggagcttccg ccgcatcgcg ggcgccgaga tccagatggt 


1200 


catcttactc 


attgccacct 


ccctggtggt gctcatctgc tccatcccgc tcgtggtgcg 


1260 


agtattcgtc 


aaccagttat 


atcagccaag tttggagcga gaagtcagta aaaatccaga 


1320 


tttgcaggcc 


atccgaattg 


cttctgtgaa ccccatccta gacccctgga tatatatcct 


1380 


cctgagaaag 


acagtgctca 


gtaaagcaat agagaagatc aaatgcctct tctgccgcat 


1440 


tggcgggtcc 


cgcagggagc 


gctccggaca gcactgctca gacagtcaaa ggacatcttc 


1500 


tgccatgtca 


ggccactctc 


gctccttcat ctcccgggag ctgaaggaga tcagcagtac 


• 1560 


atctcagacc 


ctcctgccag 


acctctcact gccagacctc agtgaaaatg gccttggagg 


1620 


caggaatttg 


cttccaggtg 


tgcctggcat gggcctggcc caggaagaca ccacctcact 


1680 


gaggacxtxg 


cgaatatcag 


agacctcaga ctcttcacag ggtcaggact cagagagtgt 


1740 


cttactggtg 


gatgaggctg 


gtgggagcgg cagggctggg cctgccccta aggggagctc 


1800 


cctgcaagtc 


acatttccca 


gtgaaacact gaacttatca gaaaaatgta tataataggc 


1860 


aaggaaagaa 


atacagtact 


gtttctggac ccttataaaa tcctgtgcaa tagacacata 


1920 


catgtcacat 


ttagctgtgc 


tcagaagggc tatcatca 


1958 



<210> 39 

<211> 1740 

<212> DNA 

<213> Homo Sapiens 

<400> 39 

cagtatccct cctgacaaaa ctaacaaaaa tcctgttagc caaataatca gccacattca 60 

tatttaccgt caaagttttt atcctcattt tacagcagtg gagagcgatt gccccgggtc 120 

ccacgttagg aagagagaga actgggattt gcacccaggc aatctgggga cagagctgtg 180 

atcacaactc catgagtcag ggccgagcca gccccttcac caccagccgg- ccgcgccccg 240 

ggaaggaagt ttgtggcgga ggaggttcgt acgggaggag ggggaggcgc ccacgcatct 300 

ggggctgact cgctctttcg caaaacgtct gggaggagtc cctggggcca caaaactgcc 360 

tccttcctga ggccagaagg agagaagacg tgcagggacc ccgcgcacag gagctgccct 420 
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cgcgacatgg 


gtcacccgcc 


gctgctgccg ctgctgctgc tgctccacac ctgcgtccca 


480 


gcctcttggg 


gcctgcggtg 


catgcagtgt aagaccaacg gggattgccg tgtggaagag 


540 


tgcgccctgg 


gacaggacct 


ctgcaggacc acgatcgtgc gcttgtggga agaaggagaa 


600 


gagctggagc 


tggtggagaa 


aagctgtacc cactcagaga agaccaacag gaccctgagc 


660 


tatcggactg 


gcttgaagat 


caccagcctt accgaggttg tgtgtgggtt agacttgtgc 


720 


aaccagggca 


actctggccg 


ggctgtcacc tattcccgaa gccgttacct cgaatgcatt 


780 


tcctgtggct 


catcagacat 


gagctgtgag aggggccggc accagagcct gcagtgccgc 


840 


agccctgaag 


aacagtgcct 


ggatgtggtg acccactgga tccaggaagg tgaagaaggg 


900 


cgtccaaagg 


atgaccgcca 


cctccgtggc tgtggctacc ttcccggctg cccgggctcc 


960 


aatggtttcc 


acaacaacga 


caccttccac ttcctgaaat gctgcaacac caccaaatgc 


1020 


aacgagggcc 


caatcctgga 


gcttgaaaat ctgccgcaga atggccgcca gtgttacagc 


1080 


tgcaagggga 


acagcaccca 


tggatgctcc tctgaagaga ctttcctcat tgactgccga 


1140 


ggccccatga 


atcaatgtct 


ggtagccacc ggcactcacg aaccgaaaaa ccaaagctat 


1200 


atggtaagag 


gctgtgcaac 


cgcctcaatg tgccaacatg cccacctggg tgacgccttc 


1260 


agcatgaacc 


acattgatgt 


ctcctgctgt actaaaagtg gctgtaacca cccagacctg 


1320 


gatgtccagt 


accgcagtgg 


ggctgctcct cagcctggcc ctgcccatct cagcctcacc 


1380 


atcaccctgc 


taatgactgc 


cagactgtgg ggaggcactc tcctctggac ctaaacctga ■ 


1440 


aatccccctc 


hctcrccctaa 


yy ai '^^yy yyyat*L.t*uuL tyLLLLULuc ccggctccca 


1500 


gccctacaga 


cttgctgtgt 


gacctcaggc cagtgtgccg acctctctgg gcctcagttt 


1560 


tcccagctat 


gaaaacagct 


atctcacaaa gttgtgtgaa gcagaagaga aaagctggag 


1620 


gaaggccgtg 


ggcaatggga 


gagctcttgt tattattaat attgttgccg ctgttgtgtt 


1680 


gttgttatta 


attaatattc 


atattattta ttttatactt acataaagat tttgtaccag 


1740 



<210> 40 

<211> 3088 

<212> DNA 

<213> Homo Sapiens 

<400> 40 

ttgcttgagt catcttctga agctttaaaa acaattgatg aattggcctt caagatagac 60 

ctaaatagca catcacatgt gaatattaca actcggaact tggctctcag cgtatcatcc 120 

ctgttaccag ggacaaatgc aatttcaaat tttagcattg gtcttccaag caataatgaa 180 

tcgtatttcc agatggattt tgagagtgga caagtggatc cactggcatc tgtaattttg 240 

cctccaaact tacttgagaa tttaagtcca gaagattctg tattagttag aagagcacag 300 
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tttactttct tcaacaaaac tggacttttc caggatgtag gaccccaaag aaaaacttta 360 

gtgagttatg tgatggcgtg cagtattgga aacattacta tccagaatct gaaggatcct 420 

gttcaaataa aaatcaaaca tacaagaact caggaagtgc atcatcccat ctgtgccttc 480 

tgggatctga acaaaaacaa aagttttgga ggatggaaca cgtcaggatg tgttgcacac 54 o 

agagattcag atgcaagtga gacagtctgc ctgtgtaacc acttcacaca ctttggagtt 600 

ctgatggacc ttccaagaag tgcctcacag ttagatgcaa gaaacactaa agtcctcact 660 

ttcatcagct atattgggtg tggaatatct gctatttttt cagcagcaac tctcctgaca 720 

tatgttgctt ttgagaaatt gcgaagggat tatccctcca aaatcttgat gaacctgagc 780 

acagccctgc tgttcctgaa tctcctcttc ctcctagatg gctggatcac ctccttcaat 840 

gtggatggac tttgcattgc tgttgcagtc ctgttgcatt tcttccttct ggcaaccttt 900 

acctggatgg ggctagaagc aattcacatg tacattgctc tagttaaagt atttaacact 960 

tacattcgcc gatacattct aaaattctgc atcattggct ggggtttgcc tgccttagtg 1020 

gtgtcagttg ttctagcgag cagaaacaac aatgaagtct atggaaaaga aagttatggg 1080 

aaagaaaaag gtgatgaatt ctgttggatt caagatccag tcatatttta tgtgacctgt 1140 

gctgggtatt ttggagtcat gttttttctg aacattgcca tgttcattgt ggtaatggtg 1200 

cagatctgtg ggaggaatgg caagagaagc aaccggaccc tgagagaaga agtgttaagg 1260 

aacctgcgca gtgtggttag cttgaccttt ctgttgggca tgacatgggg ttttgcattc 1320 

tttgcctggg gacccttaaa tatccccttc atgtacctct tctccatctt caattcatta 1380 

caaggcttat ttatattcat cttccactgt gctatgaagg agaatgttca gaaacagtgg 1440 

cggcggcatc tctgctgtgg tagatttcgg ttagcagata actcagattg gagtaagaca 1500 

gctaccaata tcatcaagaa aagttctgat aatctaggaa aatctttgtc ttcaagctcc 1560 

attggttcca actcaaccta tcttacatcc aaatctaaat ccagctctac cacctatttc 1620 

aaaaggaata gccacacaga taatgtctcc tatgagcatt ccttcaacaa aagtggatca 1680 

ctcagacagt gcttccatgg acaagtcctt gtcaaaactg gcccatgctg atggagatca 1740 

aacatcaatc atccctgtcc atcaggtcat tgataaggtc aagggttatt gcaatgctca 1800 

ttcagacaac ttctataaaa atattatcat gtcagacacc ttcagccaca gcacaaagtt 1860 

ttaatgtctt taagaaaaag aaatcaatct gcagaaatgt gaagatttgc aagcagtgta 1920 

aactgcaact agtgatgtaa atgtgctatt acctaggtaa ctgcatatat ataaggaatg 1980 

tattttgtta agaaggcttt tgtgaaattc agaatttttc tttttaatat atttcttcca 2040 

tggaagagtt gtcatcacta aaacttcagt actgagagta acatgactca gtagccacag 2100 
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aagctatgat ttgtaaaata tataattgaa tcagagtaat cataatgcag gggagacatt 2160 

caaattagag acaagggaga agcaatgctg aggaagaccc tagatagagc tcattttact 2220 

ccacctaatc gttatatctg gatataccca ttttctgcat cttctttctc aacaafcaaac 2280 

tgtccttgct ttggagactt taagacafctt cctaaagcac aaataaaagc ctcgtatttc 2340 

cccattgaga gttttgttcc aaggaatatg aagtgagaca tatgggtgag tcataataat 2400 

caaaataatt tatgaagagc tgggtctgca atagctagtc taaaaactac ttgtgtgtca 24 60 

gtcctctggt tatagtatat aagagcctga ggaggtctgg caagatagat ggtgtattat 2520 

ttatggatca ggctgctgca tacaaacctt gcatactatt atgcagctta cctaactctc 2580 

agactattct gagtaatgct tgcttgctaa tgaatgtata ggagaccaca ttgtaattgt 2640 

tcttagatga tggagtccat gcagtttctt agaaatcggt ctcagtgcat gctgtgcttt 2700 

ttcacatttg ctctgggtta tctgggaagt atcaggttct gggaggcaac agcattaagt 2760 

gataagaaaa ggagacattc tggcaaagcc aatctgctta aaggcaaagt ccagaacctg 2 820 

gaacctagag gcctttctct ctgcacgaaa aacaggtagt ttgcagtctg agatatggga 288 0 

gagcttttag gctacacagc aacccaaggg acctctcacc ttttgctgag cttcaatcag 294 0 

gaagctattt gcctggctcc agcagatgat gagataatga ggtagtgggt tttttattac 3 000 

tgttccattt tgcaacatcc tgcaacacca tcctgggaga caagagcatt acccagcttg 3 060 

gctttcacgg gggagggttg tattcagt 3 o dQ 

<210> 41 

<211> 3868 

<212> DNA 

<213> Homo Sapiens 

<400> 41 

atgaacctct gaaaactgcc ggcatctgag gtttcctcca aggccctctg aagtgcagcc 60 

cataatgaag gtcttggcgg caggagttgt gcccctgctg ttggttctgc actggaaaca 120 

tggggcgggg agccccctcc ccatcacccc tgtcaacgcc acctgtgcca tacgccaccc 180 

atgtcacaac aacctcatga accagatcag gagccaactg gcacagctca atggcagtgc 240 

caatgccctc tttattctct attacacagc ccagggggag ccgttcccca acaacctgga 300 

caagctatgt ggccccaacg tgacggactt cccgcccttc cacgccaacg gcacggagaa 360 

ggccaagctg gtggagctgt accgcatagt cgtgtacctt ggcacctccc tgggcaacat 420 

cacccgggac cagaagatcc tcaaccccag tgccctcagc ctccacagca agctcaacgc 480 

caccgccgac atcctgcgag gcctccttag caacgtgctg tgccgcctgt gcagcaagta 540 

ccacgtgggc catgtggacg tgacctacgg ccctgacacc tcgggtaagg atgtcttcca 600 
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gaagaagaag ctgggctgtc aactcctggg gaagtataag cagatcatcg ccgtgttggc 660 

ccaggccttc tagcaggagg tcttgaagtg tgctgtgaac cgagggatct caggagttgg 72 o 

gtccagatgt gggggcctgt ccaagggtgg ctggggccca gggcatcgct aaacccaaat 780 

gggggctgct ggcagacccc gagggtgcct ggccagtcca ctccactctg ggctgggctg 840 

tgatgaagct gagcagagtg gaaacttcca tagggaggga gctagaagaa ggtgcccctt 90 0 

cctctgggag attgtggact ggggagcgtg ggctggactt ctgcctctac ttgtcccttt 960 

ggccccttgc tcactttgtg cagtgaacaa actacacaag tcatctacaa gagccctgac 1020 

cacagggtga gacagcaggg cccaggggag tggaccagcc cccagcaaat tatcaccatc 1080 

tgtgcctttg ctgcccctta ggttgggact taggtgggcc agaggggcta ggatcccaaa 1140 

ggactccttg tcccctagaa gtttgatgag tggaagatag agaggggcct ctgggatgga 1200 

aggctgtctt cttttgagga tgatcagaga acttgggcat aggaacaatc tggcagaagt 1260 

ttccagaagg aggtcacttg gcattcaggc tcttggggag gcagagaagc caccttcagg 1320 

cctgggaagg aagacactgg gaggaggaga ggcctggaaa gctttggtag gttcttcgtt 1380 

ctcttccccg tgatcttccc tgcagcctgg gatggccagg gtctgatggc tggacctgca 1440 

gcaggggttt gtggaggtgg gtagggcagg ggcaggttgc taagtcaggt gcagaggttc 1500 

tgagggaccc aggctcttcc tctgggtaaa ggfcctgtaag aaggggctgg ggtagctcag 1560 

agtagcagct cacatctgag gccctgggag gtcttgtgag gtcacacaga ggtacttgag 1620 

ggggactgga ggccgtctct ggtccccagg gcaagggaac agcagaactt agggtcaggg 1680 

tctcagggaa ccctgagctc caagcgtgct gtgcgtctga cctggcatga tttctattta 1740 

ttatgatatc ctatttatat taacttattg gtgctttcag tggccaagtt aattcccctt 1800 

tccctggtcc ctactcaaca aaatatgatg atggctcccg acacaagcgc cagggccagg i860 

gcttagcagg gcctggtctg gaagtcgaca atgttacaag tggaataagc ttacgggtga 1920 

agctcagaga agggtcggat ctgagagaat ggggaggcct gagtgggagt ggggggcctt 1980 

gctccacccc catcccctac tgtgacttgc tttagcgtgt cagggtccag gctgcagggg 2040 

ctgggccaat ttgtggagag gccgggtgcc tttctgtctt gcttccaggg ggctggttca 2100 

cactgttctt gggcgcccca gcattgtgtt gtgaggcgca ctgttcctgg cagatattgt 2160 

gccccctgga gcagtgggca agacagtcct tgtggcccac cctgtccttg tttctgtgtc 2220 

cccatgctgc ctctgaaata gcgccctgga acaaccctgc ccctgcaccc agcatgctcc 2280 ' 

gacacagcag ggaagctcct cctgtggccc ggacacccat agacggtgcg gggggcctgg 23 4 0 

ctgggccaga ccccaggaag gtggggtaga ctggggggat cagctgccca ttgctcccaa 2400 

gaggaggaga gggaggctgc agacgcctgg gactcagacc aggaagctgt gggccctcct 2460 
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gctccacccc 


catcccactc 


ccacccatgt ctgggctccc aggcagggaa cccgatctct 


2520 


tcctttgtgc 


tggggccagg 


cgagtggaga aacgccctcc agtctgagag caggggaggg 


2580 


aaggaggcag 


cagagttggg 


gcagctgctc agagcagtgt tctggcttct tctcaaaccc 


2640 


tgagcgggct 


gccggcctcc 


aagttcctcc gacaagatga tggtactaat tatggtactt 


2700 


ttcactcact 


ttgcaccttt 


ccctgtcgct ctctaagcac tttacctgga tggcgcgtgg 


2760 


gcagtgtgca 


ggcaggtcct 


gaggcctggg gttggggtgg agggtgcggc ccgqaqttcrt 


2820 


ccatctgtcc 


atcccaacag 


caagacgagg atgtggctgt tgagatgtgg gccacactca 


2880 


cccttgtcca 


ggatgcaggg 


actgccttct ccttcctgct tcatccggct tagcttgggg 


2940 


ctggctgcat 


tcccccagga 


tgggcttcga gaaagacaaa cttgtctgga aaccagagtt 


3000 


gctgattcca 


cccggggggc 


ccggctgact cgcccatcac ctcatctccc tgtggacttq 


3060 


ggagctctgt 


gccaggccca 


ccttgcggcc ctggctctga gtcgctctcc cacccagcct 


3120 


ggacttggcc 


ccatgggacc 


catcctcagt gctccctcca gatcccgtcc ggcagcttgg 


3180 


cgtccaccct 


gcacagcatc 


actgaatcac agagcctttg cgtgaaacag ctctgccagg 


3240 


ccgggagctg 


ggtttctctt 


ccctttttat ctgctggtgt ggaccacacc tgggcctggc 


3300 


cggaggaaga 


gagagtttac 


caagagagat gtctccgggc ccttatttat tatttaaaca 


3360 


tttttttaaa 


aagcactgct 


agtttacttg tctctcctcc ccatcgtccc catcgtcctc 


3420 


cttgtccctg 


acttggggca 


cttccaccct gacccagcca gtccagctct gccttgccgg 


3480 


ctctccagag 


tagacatagt 


9tgtggggtt ggagctctqq cacccqqqqa aataacattt 


3540 


ccctgcagat 


ggtacagatg 


ttcctgcctt agagtcatct ctagttcccc acctcaatcc 


3600 


cggcatccag 


ccttcagtcc 


cgcccacgtg ctagctccgt gggcccaccg tgcggcctta 


3660 


gaggtttccc 


tccttccttt 


ccactgaaaa gcacatggcc ttgggtgaca aattcctctt 


3720 


tgatgaatgt 


accctgtggg 


gatgtttcat actgacagat tatttttatt tattcaatgt 


3780 


catatttaaa 


atatttattt 


tttataccaa atgaatcact ttttttttta agaaaaaaaa 


3840 


gagaaatgaa 


taaagaatct 


actcttcg 


3868 



<210> 42 

<211> 3145 

<212> DNA 

<213> Homo Sapiens 

<400> 42 

ccccaggtcc ggacaggccg agatgacgcc gagccccctg ttgctgctcc tgctgccgcc 60 

gctgctgctg ggggccttcc caccggccgc cgccgcccga ggccccccaa agatggcgga 120 

caaggtggtc ccacggcagg tggcccggct gggccgcact gtgcggctgc agtgcccagt 180 
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99agggggac ccgccgccgc tgaccatgtg gaccaaggat ggccgcacca tccacagcgg 240 

ctggagccgc ttccgcgtgc tgccgcaggg gctgaaggtg aagcaggtgg agcgggagga 300 

tgccggcgtg tacgtgtgca aggccaccaa cggcttcggc agccttagcg tcaactacac 360 

cctcgtcgtg ctggatgaca ttagcccagg gaaggagagc ctggggcccg acagctcctc 420 

tgggggtcaa gaggaccccg ccagccagca gtgggcacga ccgcgcttca cacagccctc 480 

caagatgagg cgccgggtga tcgcacggcc cgtgggtagc tccgtgcggc tcaagtgcgt 540 

ggccagcggg caccctcggc ccgacatcac gtggatgaag gacgaccagg ccttgacgcg 600 

cccagaggcc gctgagccca ggaagaagaa gtggacactg agcctgaaga acctgcggcc 660 

ggaggacagc ggcaaataca cctgccgcgt gtcgaaccgc gcgggcgcca tcaacgccac 720 

ctacaaggtg gatgtgatcc agcggacccg ttccaagccc gtgctcacag gcacgcaccc 780 

cgtgaacacg acggtggact tcggggggac cacgtccttc cagtgcaagg tgcgcagcga 840 

cgtgaagccg gtgatccagt ggctgaagcg cgtggagtac ggcgccgagg gccgccacaa 900 

ctccaccatc gatgtgggcg gccagaagtt tgtggtgctg cccacgggtg acgtgtggtc 960 

gcggcccgac ggctcctacc tcaataagct gctcatcacc cgtgcccgcc aggacgatgc 102 0 

gggcatgtac atctgccttg gcgccaacac catgggctac agcttccgca gcgccttcct 1080 

caccgtgctg ccagacccaa aaccgcaagg gccacctgtg gcctcctcgt cctcggccac 1140 

tagcctgccg tggcccgtgg tcatcggcat cccagccggc gctgtcttca tcctgggcac 1200 

cctgctcctg tggctttgcc aggcccagaa gaagccgtgc acccccgcgc ctgcccctcc 1260 

cctgcctggg caccgcccgc cggggacggc ccgcgaccgc agcggagaca aggaccttcc 1320 

ctcgttggcc gccctcagcg ctggccctgg tgtggggctg tgtgaggagc atgggtctcc 1380 

ggcagccccc cagcacttac tgggcccagg cccagttgct ggccctaagt tgtaccccaa 1440 

actctacaca gacatccaca cacacacaca cacacactct cacacacact cacacgtgga 1500 

gggcaaggtc caccagcaca tccactatca gtgctagacg gcaccgtatc tgcagtgggc 1560 

acgggggggc cggccagaca ggcagactgg gaggatggag gacggagctg cagacgaagg 1620 

caggggaccc atggcgagga ggaatggcca gcaccccagg cagtctgtgt gtgaggcata 1680 

gcccctggac acacacacac agacacacac actacctgga tgcatgtatg cacacacatg 1740 

cgcgcacacg tgctccctga aggcacacgt acgcacacac gcacatgcac agatatgccg 1800 

cctgggcaca cagataagct gcccaaatgc acgcacacgc acagagacat gccagaacat I860 

acaaggacat gctgcctgaa catacacacg cacacccatg cgcagatgtg ctgcctggac 1920 

acacacacac acacggatat gctgtctgga cgcacacacg tgcagatatg gtatccggac 1980 
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acacacgtgc acagatatgc tgcctggaca cacagataat gctgccttga cacacacatg 2040 

cacggatatt gcctggacac acacacacac acgcgtgcac agatatgctg tctggacagg 2100 

cacacacatg cagatatgct gcctggacac acacttccag acacacgtgc acaggcgcag 2160 

atatgctgcc tggacacacg cagatatgct gtctagtcac acacacacgc agacatgctg 2220 

tccggacaca cacacgcatg cacagatatg ctgtccggac acacacacgc acgcagatat 2280 

gctgcctgga cacacacaca gataatgctg cctcaacact cacacacgtg cagatattgc 2340 

ctggacacac acatgtgcac agatatgctg tctggacatg cacacacgtg cagatatgct 2400 

gtccggatac acacgcacgc acacatgcag atatgctgcc tgggcacaca cttccggaca 2460 

cacatgcaca cacaggtgca gatatgctgc ctggacacac gcagactgac gtgcttttgg 2520 

g a gggtgtgc cgtgaagcct gcagtacgtg tgccgtgagg ctcatagttg atgagggact 2580 

ttccctgctc caccgtcact cccccaactc tgcccgcctc tgtccccgcc tcagtccccg 2640 

cctccatccc cgcctctgtc ccctggcctt ggcggctatt tttgccacct gccttgggtg 2700 . 

cccaggagtc ccctactgct gtgggctggg gttgggggca cagcagcccc aagcctgaga 2760 

ggctggagcc catggctagt ggctcatccc cactgcattc tccccctgac acagagaagg 2820 

ggccttggta tttatattta agaaatgaag ataatattaa taatgatgga aggaagactg 2880 

ggttgcaggg actgtggtct ctcctggggc ccgggacccg cctggtcttt cagccatgct 294 0 

gatgaccaca ccccgtccag gccagacacc accccccacc ccactgtcgt ggtggcccca 3000 

gatctctgta attttatgta gagtttgagc tgaagccccg tatatttaat ttattttgtt 3060 

aaacatgaaa gtgcatcctt tccctccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3120 

aaaaaaaaaa aaaaaaaaaa aaaaa 3145 

<210> 43 

<211> 3273 

<212> DNA 

<213> Homo Sapiens 

<400> 43 

tgaaaggcgg ttgtggtgca aaggaaaacc cacaggccaa ggaatgggaa gaccaaggtt 60 

gacacttgtt tgtcacgtgt caataatcat ctctgcccgg gacctcagca tgaacaacct 120 

cacagagctt cagcctggcc tcttccacca cctgcgcttc ttggaggagc tgcgtctctc 180 

tgggaaccat ctctcacaca tcccaggaca agcattctct ggtctctaca gcctgaaaat 240 

cctgatgctg cagaacaatc agctgggagg aatccccgca gaggcgctgt gggagctgcc 300 

gagcctgcag tcgctgcgcc tagatgccaa cctcatctcc ctggtcccgg agaggagctt 360 

tgaggggctg tcctccctcc gccacctctg gctggacgac aatgcactca cggagatccc 420 
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tgtcagggcc ctcaacaacc tccctgccct gcaggccatg accctggccc tcaaccgcat 480 

cagccacatc cccgactacg cgttccagaa tctcaccagc cttgtggtgc tgcatttgca 540 

taacaaccgc atccagcatc tggggaccca cagcttcgag gggctgcaca atctggagac 600 

actagacctg aattataaca agctgcagga gttccctgtg gccatccgga ccctgggcag 660 

actgcaggaa ctggggttcc ataacaacaa catcaaggcc atcccagaaa aggccttcat 720 

g9ggaaccct ctgctacaga cgatacactt ttatgataac ccaatccagt ttgtgggaag 780 

i 

atcggcattc cagtacctgc ctaaactcca cacactatct ctgaatggtg ccatggacat 84 0 

ccaggagttt ccagatctca aaggcaccac cagcctggag atcctgaccc tgacccgcgc 900 

aggcatccgg ctgctcccat cggggatgtg ccaacagctg cccaggctcc gagtcctgga 960 

actgtctcac aatcaaattg aggagctgcc cagcctgcac aggtgtcaga aattggagga 1020 

aatcggcctc caacacaacc gcatctggga aattggagct gacaccttca gccagctgag 1080 

ctccctgcaa gccctggatc ttagctggaa cgccatccgg tccatccacc ccgaggcctt 1140 

ctccaccctg cactccctgg tcaagctgga cctgacagac aaccagctga ccacactgcc 120 0 

cctggctgga cttgggggct tgatgcatct gaagctcaaa gggaaccttg ctctctccca 1260 

ggccttctcc aaggacagtt tcccaaaact gaggatcctg gaggtgcctt atgcctacca 132 0 

gtgctgtccc tatgggatgt gtgccagctt cttcaaggcc tctgggcagt gggaggctga 1380 

agaccttcac cttgatgatg aggagtcttc aaaaaggccc ctgggcctcc ttgccagaca 1440 

agcagagaac cactatgacc aggacctgga tgagctccag ctggagatgg aggactcaaa 1500 

gccacacccc agtgtccagt gtagccctac tccaggcccc ttcaagccct gtgagtacct 1560 

ctttgaaagc tggggcatcc gcctggccgt gtgggccatc gtgttgctct ccgtgctctg 1620 

caatggactg gtgctgctga ccgtgttcgc tggcgggcct gtccccctgc ccccggtcaa 1680 

gtttgtggta ggtgcgattg caggcgccaa caccttgact ggcatttcct gtggccttct 1740 

agcctcagtc gatgccctga cctttggtca gttctctgag tacggagccc gctgggagac 1800 

ggggctaggc tgccgggcca ctggcttcct ggcagtactt gggtcggagg catcggtgct i860 

gctgctdact ctggccgcag tgcagtgcag cgtctccgtc tcctgtgtcc gggcctatgg 1920 

gaagtccccc tccctgggca gcgttcgagc aggggtccta ggctgcctgg cactggcagg 198 0 

gctggccgcc gcgctgcccc tggcctcagt gggagaatac ggggcctccc cactctgcct 2040 

gccctacgcg ccacctgagg gtcagccagc agccctgggc ttcaccgtgg ccctggtgat 2100 

gatgaactcc ttctgtttcc tggtcgtggc cggtgcctac atcaaactgt actgtgacct 2160 

gccgcggggc gactttgagg ccgtgtggga ctgcgccatg gtgaggcacg tggcctggct 2220 

catcttcgca gacgggctcc tctactgtcc cgtggccttc ctcagctttg cctccatgct 2280 
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gggcctcttc cctgtcacgc ccgaggccgt caagtctgtc ctgctggtgg tgctgcccct 2340 

gcctgcctgc ctcaacccac tgctgtacct gctcttcaac ccccacttcc gggatgacct 2400 

tcggcggctt cggccccgcg caggggactc agggccccta gcctatgctg cggccgggga 2460 

gctggagaag agctcctgtg attctaccca ggccctggta gccttctctg atgtggatct 252 0 

cattctggaa gcttctgaag ctgggcggcc ccctgggctg gagacctatg gcttcccctc 2580 

agtgaccctc atctcctgtc agcagccagg ggcccccagg ctggagggca gccattgtgt 264 0 

agagccagag gggaaccact ttgggaaccc ccaaccctcc atggatggag aactgctgct 2700 

gagggcagag ggatctacgc cagcaggtgg aggcttgtca gggggtggcg gctttcagcc 2760 

ctctggcttg gcctttgctt cacacgtgta aatatccctc cccattcttc tcttcccctc 2820 

tcttcccttt cctctctccc cctcggtgaa tgatggctgc ttctaaaaca aatacaacca 2880 

aaactcagca gtgtgatcta tagcaggatg gcccagtccc tggctccact gatcacctct 294 0 

ctcctgtgac catcaccaac gggtgcctct tggcctggct ttcccttggc cttcctcagc 3000 

ttcaccttga tactgggcct cttccttgtc atgtctgaag ctgtggacca gagacctgga 3060 

cttttgtctg cttaagggaa atgagggaag taaagacagt gaaggggtgg agggttgatc 3120 

agggcacagt ggacagggag acctcacaga gaaaggcctg gaaggtgatt tcccgtgtga 3180 

ctcatggata ggatacaaaa tgtgttccat gtaccattaa tcttgacata tgccatgcat 3240 

aaagacttcc tattaaaata agctttggaa gag 3373 

<210> 44 

<211> 2192 

<212> DNA 

<213> Homo Sapiens 

<400> 44 

agtctggccc tggacaaccc cagcaaagcc gccctcagcc agcccagaag cactgggcct 60 

tggccacagc aacacccact gagcacgctg ggagctgagt atggcgtccc tggtctcgct 120 

ggagctgggg ctgcttctgg ctgtgctggt ggtgacggcg acggcgtccc cgcctgctgg 180 

tctgctgagc ctgctcacct ctggccaggg cgctctggat caagaggctc tgggcggcct 240 

gttaaatacg ctggcggacc gtgtgcactg caccaacggg ccgtgtggaa agtgcctgtc 300 

tgtggaggac gccctgggcc tgggcgagcc tgaggggtca gggctgcccc cgggcccggt 360 

cctggaggcc aggtacgtcg cccgcctcag tgccgccgcc gtcctgtacc tcagcaaccc 420 

cgagggcacc tgtgaggaca ctcgggctgg cctctgggcc fcctcatgcag accacctcct 480 

ggccctgctc gagagcccca aggccctgac cccgggcctg agctggctgc tgcagaggat 540 

gcaggcccgg gctgccggcc agacccccaa gacggcctgc gtagatatcc ctcagctgct 600 
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ggaggaggcg gtgggggcgg gggctccggg cagtgctggc ggcgtcctgg ctgccctgct 660 

ggaccatgtc aggagcgggt cttgcttcca cgccttgccg agccctcagt acttcgtgga 720 

ctttgtgttc cagcagcaca gcagcgaggt ccctatgacg ctggccgagc tgtcagcctt 780 

gatgcagcgc ctgggggtgg gcagggaggc ccacagtgac cacagtcatc ggcacagggg 840 

agccagcagc cgggaccctg tgcccctcat cagctccagc aacagctcoa gtgtgtggga 900 

cacggtatgc ctgagtgcca gggacgtgat ggctgcatat ggactgtcgg aacaggctgg 960 

ggtgaccccg gaggcctggg cccaactgag ccctgccctg ctccaacagc agctgagtgg 1020 

agcctgcacc tcccagtcca ggccccccgt ccaggaccag ctcagccagt cagagaggta 1080 

tctgtacggc tccctggcca cgctgctcat ctgcctctgc gcggtctttg gcctcctgct n 40 

gctgacctgc actggctgca ggggggtcgc ccactacatc ctgcagacct tcctgagcct 1200 

ggcagtgggt gcactcactg gggacgctgt cctgcatctg acgcccaagg tgctggggct 12 60 

gcatacacac agcgaagagg gcctcagccc acagcccacc tggcgcctcc tggctatgct 1320 

ggccgggctc tacgccttct tcctgtttga gaacctcttc aatctcctgc tgcccaggga 1380 

cccggaggac ctggaggaog ggccctgcgg ccacagcagc catagccacg ggggccacag 1440 

ccacggtgtg tccctgcagc tggcacccag cgagctccgg cagcccaagc ccccccacga 1500 

gggctcccgc gcagacctgg tggcggagga gagcccggag ctgctgaacc ctgagcccag 1560 

gagactgagc ccagagttga ggctactgcc ctatatgatc actctgggcg acgccgtgca 1620 

caacttcgcc gacgggctgg ccgtgggcgc cgccttcgcg tcctcctgga agaccgggct 1680 

ggccacctcg ctggccgtgt tctgccacga gttgccacac gagctggggg acttcgccgc 1740 

cttgctgcac gcggggctgt ccgtgcgcca agcactgctg ctgaacctgg cctccgcgct 1800 

cacggccttc gctggtctct acgtggcact ogoggttgga gtcagcgagg agagcgaggc i860 

ctggatcctg goagtggcca ocggcctgtt cctctacgta gcactctgcg acatgctccc 1920 

ggcgatgttg aaagtacggg acccgcggcc ctggctcctc ttcctgctgc acaacgtggg i 98 o 

cctgctgggc ggctggaccg tccfcgctgct gctgtccctg tacgaggatg acatcacctt 2040 

ctgataccct gccctagtcc cccacctttg acttaagatc ccacacctca caaacctaca 2100 

gcccagaaac cagaagcccc tatagaggcc ccagtcccaa ctccagtaaa gacactcttg 2160 
tccttggaaa aaaaaaaaaa aaaaaaaaaa aa 



<210> 45 

<211> 3014 

<212> DNA 

<213> Homo Sapiens 
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<400> 45 

aagctcgggc tccggcacgt agttgggaaa cttgcgggtc ctagaagtcg cctccccgcc 60 

ttgccggccg cccttgcagc cccgagccga gcagcaaagt gagacattgt gcgcctgcca 120 

gatccgccgg ccgcggaccg gggctgcctc ggaaacacag aggggtcttc tctcgccctg 180 

catataatta gcctgcacac aaagggagca gctgaatgga ggttgtcact ctctggaaaa 24 o 

ggatttctga ccgagcgctt ccaatggaca ttctccagtc tctctggaaa gattctcgct 300 

aatggatttc ctgctgctcg gtctctgtct atactggctg ctgaggaggc cctcgggggt 360 

ggtcttgtgt ctgctggggg cctgctttca gatgctgccc gccgccccca gcgggtgccc 42 0 

gcagctgtgc cggtgcgagg ggcggctgct gtactgcgag gcgctcaacc tcaccgaggc 480 

gccccacaac ctgtccggcc tgctgggctt gtccctgcgc tacaacagcc tctcggagct 540 

gcgcgccggc cagttcacgg ggttaatgca gctcacgtgg ctctatctgg atcacaatca 600 

catctgctcc gtgcaggggg acgcctttca gaaactgcgc cgagttaagg aactcacgct 660 

gagttccaac cagatcaccc aactgcccaa caccaccttc cggcccatgc ccaacctgcg 720 

cagcgtggac ctctcgtaca acaagctgca ggcgctcgcg cccgacctct tccacgggct 7 80 

gcggaagctc accacgctgc atatgcgggc caacgccatc cagtttgtgc ccgtgcgcat 840 

cttccaggac tgccgcagcc tcaagtttct cgacatcgga tacaatcagc tcaagagtct 900 

ggcgcgcaac tctttcgccg gcttgtttaa gctcaccgag ctgcacctcg agcacaacga 960 

cttggtcaag gtgaacttcg cccacttccc gcgcctcatc tccctgcact cgctctgcct 1020 

gcggaggaac aaggtggcca ttgtggtcag ctcgctggac tgggtttgga acctggagaa 1080 

aatggacttg tcgggcaacg agatcgagta catggagccc catgtgttcg agaccgtgcc 1140 

gcacctgcag tccctgcagc tggactccaa ccgcctcacc tacatcgagc cccggatcct 1200 

caactcttgg aagtccctga caagcatcac cctggccggg aacctgtggg attgcgggcg 1260 

caacgtgtgt gccctagcct cgtggctcaa caacttccag gggcgctacg atggcaactt 1320 

gcagtgcgcc agcccggagt acgcacaggg cgaggacgtc ctggacgccg tgtacgcctt 1380 

ccacctgtgc gaggatgggg ccgagcccac cagcggccac ctgctctcgg ccgtcaccaa 1440 

ccgcagtgat ctggggcccc ctgccagctc ggccaccacg ctcgcggacg gcggggaggg 1500 

gcagcacgac ggcacattcg agcctgccac cgtggctctt ccaggcggcg agcacgccga 1560 

gaacgccgtg cagatccaca aggtggtcac gggcaccatg gccctcatct tctccttcct 1620 

catcgtggtc ctggtgctct acgtgtcctg gaagtgtttc ccagccagcc tcaggcagct 1680 

cagacagtgc tttgtcacgc agcgcaggaa gcaaaagcag aaacagacca tgcatcagat 1740 

ggctgccatg tctgcccagg aatactacgt tgattacaaa ccgaaccaca ttgagggagc 1800 
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cctggtgacc atcaacgagt atggctcgtg tacctgccac cagcagcccg cgagggaatg i860 

c 9*ggtgtga ttgtcccagt ggctctcaac ccatgcgcta ccaaatacgc ctgggcagcc 1920 

gggacgggcc ggcgggcacc aggctggggt ctccttgtct gtgctctgat atgctccttg 1B80 

actgaaactt taaggggatc tctcccagag acttgacatt ttagctttat tgtgfccttaa 2040 

aaacaaaagc gaattaaaac acaacaaaaa accccacccc acaaccttca ggacagtcta 2100 

tcttaaattt catatgagaa ctccttcctc cctttgaaga tctgtccata ttcaggaatc 2160 

tgagagtgta aaaaagttct gtcaacagat tcagtcccca gtcctccttc tctgagcaga 2220 

cctctcatcc gcagcatata tctgctagat cttctaagag ctgaaatgga gacttctgga 2280 

agaaggtggg aaagaaatcc atctcggctt aattaaccat ttatcgcatc atattactcc 2340 

catcttaaaa gtgcacgcgt tgtttttctg aaccctcaca caaaggctac tactgtggtc 24 00 

ccatatctgt cggcccatga gaaacagtgt tcttggacct cacagccaag cagcactgaa 2460 

ctgcagcaaa atccagcaac acattcagca gcgagcagcc tgctgagctc cactggttta 2 520 

tccggggcca ccaaccccaa agaactggga tgaaagcaga tgtgagagag gaaaaggatc 2580 

tgtttttgtt ttattttcta ccaggcccag ctctttgtgg ggggaataaa aaagaagaaa 2640 

aatcgagctc caagctggtg ccctgccaag cttcctcccc tcccttccta gtccaagcac 2700 

tccaccgtct gtgcagactg cataacagca atttctggaa acaggctgaa gaatctgggc 2760 

cagtccagag gcagtggatt cctggtttat gtgtggtggg gtfctttagga attttatttt 2820 

tcaccttaat tctttcaaca actgccagct gtttgaagca catctgtaat aaacagcttc 2880 

tgtttgtaaa atgagactga agttatcctc tccagagaaa ttcctgaatc ttctctgtag 2940 

ttcaatgcct tcactgacag tttggctcaa aaagtatgag tgtggtaaat attaaagaat 3 000 

gttaatacaa gtgt 3014 

<210> 46 

<211> 1128 

<212> DNA 

<213> Homo Sapiens 

<400> 46 

atggcgaacg cgagcgagcc gggtggcagc ggcggcggcg aggcggccgc cctgggcctc so 

aagctggcca cgctcagcct gctgctgtgc gtgagcctag cgggcaacgt gctgttcgcg 120 

ctgctgatcg tgcgggagcg cagcctgcac cgcgccccgt actacctgct gctcgacctg 180 

tgcctggccg acgggctgcg cgcgctcgcc tgcctcccgg ccgtcatgct ggcggcgcgg 240 

cgtgcggcgg ccgcggcggg ggcgccgccg ggcgcgctgg gctgcaagct gctcgccttc 300 

ctggccgcgc tcttctgctt ccacgccgcc ttcctgctgc tgggcgtggg cgtcacccgc 360 
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tacctggcca tcgcgcacca ccgcttctat gcagagcgcc tggccggctg gccgtgcgcc 420 

gccatgctgg tgtgcgccgc ctgggcgctg gcgctggccg cggccttccc gccagtgctg 480 

gacggcggtg gcgacgacga ggacgcgccg tgcgccctgg agcagcggcc cgacggcgcc 540 

cccggcgcgc tgggcttcct gctgctgctg gccgtggtgg tgggcgccac gcacctcgtc 600 

tacctccgcc tgctcttctt catccacgac cgccgcaaga tgcggcccgc gcgcctggtg 660 

cccgccgtca gccacgactg gaccttccac ggcccgggcg ccaccggcca ggcggccgcc 720 

aactggacgg cgggcttcgg ccgcgggccc acgccgcccg cgcttgtggg catccggccc 780 

gcagggccgg gccgcggcgc gcgccgcctc ctcgtgctgg aagaattcaa gacggagaag 840 

aggctgtgca agatgttcta cgccgtcacg ctgctcttcc tgctcctctg ggggccctac 9QQ 

gtcgtggcca gctacctgcg ggtcctggtg cggcccggcg ccgtccccca ggcctacctg 960 

acggcctccg tgtggctgac cttcgcgcag gccggcatca accccgtcgt gtgcttcctc 1020 

ttcaacaggg agctgaggga ctgcttcagg gcccagttcc cctgctgcca gagcccccgg 1080 

accacccagg cgacccatcc ctgcgacctg aaaggcattg gtttatga 1128 

<210> 47 

<211> 1736 

<212> DNA 

<213> Homo Sapiens 

<400> 47 

gagggagggg cgggggctgg aggcagcagc gcccccgcac tccccgcgtc tcgcacactt 60 

gcaccggtcg ctcgcgcgca gcccggcgtc gccccacgcc gcgctcgctc ctccctccct 120 

cctcccgctc cgtggctccc gtgctcctgg cgaggctcag gcgcggagcg cgcggacggg 180 

cgcaccgaca gacggccccg gggacgcctc ggctcgcgcc tcccgggcgg gctatgttga 240 

ttgccccgcc ggggccggcc cgcgggatca gcacagcccg gcccgcggcc ccggcggcca 300 

atcgggacta tgaaccggaa agcgcggcgc tgcctgggcc acctctttct cagcctgggc 360 

atggtctacc tccggatcgg tggcttctcc tcagtggtag ctctgggcgc aagcatcatc 420 

tgtaacaaga tcccaggcct ggctcccaga cagcgggcga tctgccagag ccggcccgac 480 

gccatcatcg tcataggaga aggctcacaa atgggcctgg acgagtgtca gtttcagttc 540 

cgcaatggcc gctggaactg ctctgcactg ggagagcgca ccgtcttcgg gaaggagctc 600 

aaagtgggga gccgggaggc tgcgttcacc tacgccatca ttgccgccgg cgtggcccac 660 

gccatcacag ctgcctgtac ccagggcaac ctgagcgact gtggctgcga caaagagaag 720 

caaggccagt accaccggga cgagggctgg aagtggggtg gctgctctgc cgacatccgc 780 

tacggcatcg gcttcgccaa ggtctttgtg gatgcccggg agatcaagca gaatgcccgg 840 
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actctcatga acttgcacaa caacgaggca ggccgaaaga tcctggagga 
ctggaatgta agtgccacgg cgtgtcaggc tcgtgcacca ccaagacgtg 
ctgccacagt ttcgggagct gggctacgtg ctcaaggaca agtacaacga 
gtggagcctg tgcgtgccag ccgcaacaag cggcccacct tcctgaagat 
ctgtcgtacc gcaagcccat ggacacggac ctggtgtaca tcgagaagtc 
tgcgaggagg acccggtgac cggcagtgtg ggcacccagg gccgcgcctg 
gctccccagg ccagcggctg tgacctcatg tgctgtgggc gtggctacaa 
tacgcccgcg tgtggcagtg caactgtaag ttccactggt gctgctatgt 
acgtgcagcg agcgcacgga gatgtacacg tgcaagtgag ccccgtgtgc 
cccgctgcaa gtcagattgc tgggaggact ggaccgtttc caagctgcgg 
aggatgctga gcttgtcttt tctgctgagg agggtacttt tcctgggttt 
tccgtggggg aaaaaaaatc tctcagagcc ctcaactatt ctgttccaca 
ctccaccctc ccccagacac agcccaggtc cctccgcggc tggagcgaag 
caggaactct ggacccctgg gcctcatcac agcaatattt aacaatttat 
ataatattaa tttatttaat taaaaagaat tcttccacaa aaaaaaaaaa 

<210> 48 

<211> 3195 

<212> DNA 

<213> Homo Sapiens 

<400> 48 

acagcatgga gtggggttac ctgttggaag tgacctcgct gctggccgcc 
tgcagcgctc tagcggcgct gcggccgcct cggccaagga gctggcatgc 
ccgtgccgct gtgtaagggc atcggctaca actacaccta catgcccaat 
acgacacgca agacgaggcg ggcctggagg tgcaccagtt ctggccgctg 
agtgctcgcc cgatctcaag ttcttcctgt gcagcatgta cacgcccatc 
actacaagaa gccgctgccg ccctgccgct cggtgtgcga gcgcgccaag 
cgccgctcat gcgccagtac ggcttcgcct ggcccgaccg catgcgctgc 
ccgagcaagg caaccctgac acgctgtgca tggactacaa ccgcaccgac 
ccgcgcccag cccgccgcgc cgcctgccgc cgccgccgcc cggcgagcag 
gcagcggcca cggccgcccg ccgggggcca ggcccccgca ccgcggaggc 
gtggcggcgg ggacgcggcg gcgcccccag ctcgcggcgg cggcggtggc 
ggccccctgg cggcggcgcg gctccctgcg agcccgggtg ccagtgccgc 
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gaacatgaag 

ctggaccaca 

ggccgttcac 

caagaagcca 

gcccaactac 

caacaagacg 

cacccaccag 

caagtgcaac 

acaccaccct 

gctccctggc 

cctgcaggca 

cccaatgctg 

ccttctgcag 

tctgataaaa 

aaaaaa 



ttggcgctgc 
caagagatca 
cagttcaacc 
gtggagatcc 
tgcctagagg 
gccggctgcg 
gaccggctgc 
ctaaccaccg 
ccgccttcgg 
ggcaggggcg 
gggaaggcgc 
gcgcctatgg 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1736 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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tgagcgtgtc cagcgagcgc cacccgctct acaaccgcgt caagacaggc cagatcgcta 780 

actgcgcgct gccctgccac aacccctttt tcagccagga cgagcgcgcc ttcaccgtct 840 

tctggatcgg cctgtggtcg gtgctctgct tcgtgtccac cttcgccacc gtctccacct 900 

fccctfcatcga catggagcgc ttcaagtacc cggagcggcc cattatcttc ctctcggcct 960 

gctacctctt cgtgtcggtg ggctacctag tgcgcctggt ggcgggccac gagaaggtgg 1020 

cgtgcagcgg tggcgcgccg ggcgcggggg gcgctggggg cgcgggcggc gcggcggcgg 1080 

gcgcgggcgc ggcgggcgcg ggcgcgggcg gcccgggcgg gcgcggcgag tacgaggagc 1140 

tgggcgcggt ggagcagcac gtgcgctacg agaccaccgg ccccgcgctg tgcaccgtgg 1200 

tcttcttgct ggtctacttc ttcggcatgg ccagctccat ctggtgggtg atcttgtcgc 1260 

tcacatggtt cctggcggcc ggtatgaagt ggggcaacga agccatcgcc ggctactcgc 1320 

agtacttcca cctggccgcg tggcttgtgc ccagcgtcaa gtccatcgcg gtgctggcgc 1380 

tcagctcggt ggacggcgac ccggtggcgg gcatctgcta cgtgggcaac cagagcctgg 1440 

acaacctgcg cggcttcgtg ctggcgccgc tggtcatcta cctcttcatc ggcaccatgt 1500 

tcctgctggc cggcttcgtg tccctgttcc gcatccgctc ggtcatcaag caacaggacg 1560 

gccccaccaa gacgcacaag ctggagaagc tgatgatccg cctgggcctg fctcaccgtgc 1620 

tctacaccgt gcccgccgcg gtggtggtcg cctgcctctt ctacgagcag cacaaccgcc 1680 

cgcgctggga ggccacgcac aactgcccgt gcctgcggga cctgcagccc gaccaggcac 1740 

gcaggcccga ctacgccgtc ttcatgctca agtacttcat gtgcctagtg gtgggcatca 1800 

cctcgggcgt gtgggtctgg tccggcaaga cgctggagtc ctggcgctcc ctgtgcaccc 1860 

gctgctgctg ggccagcaag ggcgccgcgg tgggcggggg cgcgggcgcc acggccgcgg 1920 

ggggtggcgg cgggccgggg ggcggcggcg gcgggggacc cggcggcggc ggggggccgg 1980 

gcggcggcgg gggctccctc tacagcgacg tcagcactgg cctgacgtgg cggtcgggca 204 0 

cggcgagctc cgtgtcttat ccaaagcaga tgccattgtc ccaggtctga gcggagggga 2100 

gggggcgccc aggaggggtg gggagggggg cgaggagacc caagtgcagc gaagggacac 2160 

ttgatgggct gaggttccca ccccttcaca gtgttgattg ctattagcat gataatgaac 2220 

tcttaatggt atccattagc tgggacttaa atgactcact tagaacaaag tacctggcat 2280 

tgaagcctcc cagacccagc cccttttccc ccattgatgt gcggggagct cctcccgcca 2340 

cgcgttaatt tctgttggct gaggagggtg gactctgcgg cgtttccaga acccgagatt 2400 

tggagccctc cctggctgca cttggctggg tttgcagtca gatacacaga tttcacctgg 2460 

gagaacctct ttttctccct cgactcttcc tacgtaaact cccacccctg acttaccctg 2520 

gaggaggggt gaccgccacc tgatgggatt gcacggtttg ggtattctta atgaccaggc 2580 
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aaatgcctta agtaaacaaa caagaaatgt cttaattata caccccacgt aaatacgggt 2640 

ttcttacatt agaggatgta tttatataat tatttgttaa attgtaaaaa aaaaaagtgt 2700 

aaaatatgta tatatccaaa gatatagtgt gtacattttt ttgtaaaaag tttagaggct 2760 

tacccctgta agaacagata taagtattct attttgtcaa taaaatgact tttgataaat 2820 

gatttaacca ttgccctctc ccccgcctct tctgagctgt cacctttaaa gtgcttgcta 2880 

aggacgcatg gggaaaatgg acattttctg gcttgtcatt ctgtacactg accttaggca 2940 

tggagaaaat tacttgttaa actctagttc ttaagttgtt agccaagtaa atatcattgt 3000 

tgaactgaaa tcaaaattga gtttttgcac cttccccaaa gacggtgttt ttcatgggag 3060 

ctcttttctg atccatggat aacaactctc actttagtgg atgtaaatgg aacttctgca 3X20 

aggcagtaat tccccttagg ccttgttatt tatcctgcat ggtatcacta aaggtttcaa 3180 

aaccctgaaa aaaaa , lpc . 



<210> 49 
<211> 1380 
<212> DNA 

<213> Homo Sapiens < 




<400> 49 
ccgggtcgga 


gccccccgga gctgcgcgcg ggctfcgcagc gcctcgcccg cgctgtcctc 


60 


ccggtgtccc 


gcttctccgc gccccagccg ccggctgcca gcttttcggg gccccgagtc 


120 


gcacccagcg 


aagagagcgg gcccgggaca agctcgaact ccggccgcct cgcccttccc 


180 


cggctccgct 


ccctctgccc cctcggggtc gcgcgcccac gatgctgcag ggccctggct 


240 


cgctgctgct 


gctcttcctc gcctcgcact gctgcctggg ctcggcgcgc gggctcttcc 


300 


tctttggcca 


gcccgacttc tcctacaagc gcagcaattg caagcccatc cctgccaacc 


360 


tgeagctgtg 


ccacggcatc gaataccaga acatgcggct gcccaacctg ctgggccacg 


420 


agaccatgaa 


ggaggtgctg gagcaggccg gcgcttggat cccgctggtc atgaagcagt 


480 


gccacccgga 


caccaagaag ttcctgtgct cgctcttcgc ccccgtctgc ctcgatgacc 


540 


tagacgagac 


catccagcca tgccactcgc tctgcgtgca ggtgaaggac cgctgcgccc 


600 


cggtcatgtc 


cgccttcggc ttcccctggc ccgacatgct tgagtgcgac cgtttccccc 


660 


aggacaacga 


cctttgcafcc cccctcgcta gcagcgacca cctcctgcca gccaccgagg 


720 


aagctccaaa 


ggtatgtgaa gcctgcaaaa ataaaaatga tgatgacaac gacataatgg 


780 


aaacgctttg 


taaaaatgat tttgcactga aaataaaagt gaaggagata acctacatca 


840 


accgagatac 


caaaatcatc ctggagacca agagcaagac catttacaag ctgaacggtg 


900 


tgtccgaaag 


ggacctgaag aaatcggtgc tgtggctcaa agacagcttg cagtgcacct 


960 



WO 2004/073657 PCT/US2004/005455 

87/282 

gtgaggagat gaacgacatc aacgcgccct atctggtcat gggacagaaa cagggtgggg 1020 

agctggtgat cacctcggtg aagcggtggc agaaggggca gagagagttc aagcgcatct 1080 

cccgcagcat ccgcaagctg cagtgctagt cccggcatcc tgatggctcc gacaggcctg H40 

ctccagagca cggctgacca tttctgctcc gggatctcag ctcccgttcc ccaagcacac 1200 

tcctagctgc tccagtctca gcctgggcag cttccccctg ccttttgcac gtttgcatcc 1260 

ccagcatttc ctgagttata aggccacagg agtggatagc tgttttcacc taaaggaaaa 1320 

gcccacccga atcttgtaga aatattcaaa ctaataaaat catgaatatt tttatgaagt 1380 

<210> 50 

<211> 2573 

<212> DNA 

<213> Homo Sapiens 

<400> 50 

gaggagggac ctacaaagac tggaaactat tcttagctcc gtcactgact ccaagttcat 60 

cccctctgtc tttcagtttg gttgagatat aggctactct tcccaactca gtcttgaaga 120 

gtatcaccaa ctgcctcatg tgtggtgacc ttcactgtcg tatgccagtg actcatctgg 180 

agtaatctca acaacgagtt accaatactt gctcttgatt gataaacaga atggggtttt 240 

ggatcttagc aattctcaca attctcatgt attccacagc agcaaagttt agtaaacaat 300 

catggggcct ggaaaatgag gctttaattg taagatgtcc tagacaagga aaaccfcagtt 360 

acaccgtgga ttggtattac tcacaaacaa acaaaagtat tcccactcag gaaagaaatc 420 

gtgtgtttgc ctcaggccaa cttctgaagt ttctaccagc tgcagttgct gattctggta 480 

tttatacctg tattgtcaga agtcccacat tcaataggac tggatatgcg aatgt caeca 540 

tatataaaaa acaatcagat tgcaatgttc cagattattt gatgtattca acagtatctg 600 

gatcagaaaa aaattccaaa atttattgtc ctaccattga cctctacaac tggacagcac 660 

ctcttgagtg gtttaagaat tgtcaggctc ttcaaggatc aaggtacagg gcgcacaagt 720 

catttttggt cattgataat gtgatgactg aggaegcagg tgattacacc tgtaaattta 780 

tacacaatga aaatggagcc aattatagtg tgaeggegae caggtccttc aeggtcaagg 840 

atgagcaagg cttttctctg tttccagtaa tcggagcccc tgcacaaaat gaaataaagg 900 

aagtggaaat tggaaaaaac gcaaacctaa ctfcgctctgc ttgttttgga aaaggcactc 960 

agttcttggc tgccgtcctg tggcagctta atggaacaaa aattacagac tttggtgaac 1020 

caagaattca acaagaggaa gggcaaaatc aaagtttcag caatgggctg gcttgtctag 1080 

acatggtttt aagaatagct gaegtgaagg aagaggattt attgetgeag tacgactgtc 1140 

tggccctgaa tttgcatggc ttgagaaggc acacegtaag actaagtagg aaaaatccaa 1200 
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gtaaggagtg 


fcttctgagac 


tttgatcacc 


tgaactttct 


ctagcaagtg 


taagcagaat 


1260 


ggagtgtggt 


tccaagagat 


ccatcaagac 


aatgggaatg 


gcctgtgcca 


taaaatgtgc 


1320 


ttctcttctt 


cgggatgttg 


tttgctgtct 


gatctttgta 


gactgttcct 


gtttgctggg 


1380 


agcttctctg 


ctgcttaaat 


tgttcgtcct 


cccccactcc 


ctcctatcgt 


tggtttgtct 


1440 


agaacactca 


gctgcttctt 


tggtcatcct 


tgttttctaa 


ctttatgaac 


tccctctgtg 


1500 


tcactgtatg 


tgaaaggaaa 


tgcaccaaca 


accgtaaact 


gaacgtgttc 


ttttgtgctc 


1560 


ttttataact 


tgcattacat 


qttqtaaaca 


tggtccgttc 


tatacctttt 


tctggtcata 


1620 


atgaacactc 


attttgttag 


cqaqqqtqqt 


aaagtgaaca 


aaaaggggaa 


gtatcaaact 


1680 


actgccattt 


cagtgagaaa 


atcctaggtg 


ctactttata 


ataagacatt 


tqttaqqcca 


1740 


ttcttgcatt 


gatataaaga 


aatacctgag 


actgggtgat 


ttatatgaaa 


agaggtttaa 


1800 


ttggctcaca 


gttctgcagg 


ctgtatggga 


agcatggcgg 


catctgcttc 


tggggacacc 


1860 


tcaggagctt 


tactcatggc 


agaaggcaaa 


gcaaaggcag 


gcacttcaca 


cagtaaaagc 


1920 


aggagcgaga 


gagaggtgcc 


acactgaaac 


agccagatct 


catgagaagt 


cactcactat 


1980 


tgcaaggaca 


gcatcaaaga 


gatggtgcta 


aaccattcat 


gatgaactca 


cccccatgat 


2040 


ccaatcacct 


cccaccaggc 


tccacctcga 


atactgggga 


ttaccattca 


gcatgagatt 


2100 


tgggcaggaa 


cacagaccca 


aaccatacca 


cacacattat 


cattgttaaa 


ctttgtaaag 


2160 


tatttaaggt 


acatggaaca 


cacqqqaaqt 


ctggtagctc 


agcccatttc 


fcttafctcxcafc 


2220 


ctgttattca 


ccatgtaatt 


caggtaccac 


gtafctccagg 


gagocfcttct 


tggcccfccag 


2280 


tttgcagtat 


acacactttc 


caagtactct 


tgtagcatcc 


tgtttgtatc 


atagcactgg 


2340 


tcacattgcc 


ttacctaaat 


ctgtttgaca 


gtctgctcaa 


cacgactgca 


agctccatga 


2400 


gggcagggac 


atcatctctt 


ccatctttgg 


gtccttagtg 


caatacctgg 


cagctagcca 


2460 


gtgctcagct 


aaatatttgt 


tgactgaata 


aatgaatgca 


caaccaaaaa 


aaaaaaaaaa 


2520 


aaaaaaaaaa 


aaaaaaaaaa 


aataaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 


2573 



<210> 51 

<211? 803 

<212> DNA 

<213> Homo Sapiens 

<400> 51 



ccggcacgag aggagttgtg 


agtttccaag 


ccccagctca 


ctctgaccac 


ttctctgcct 


60 


gcceagcatc 


atgaagggcc 


ttgcagctgc 


cctccttgtc 


ctcgtctgca 


ccatggccct 


120 


ctgotcctgt 


gcacaagttg 


gtaccaacaa 


agagctctgc 


tgcctcgtct 


atacctcctg 


180 


gcagattcca 


caaaagttca 


tagttgacta 


ttctgaaacc 


agcccccagt 


gccccaagcc 


240 
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aggtgtcatc 


ctcctaacca 


agagaggccg 


gcagatctgt 


gctgacccca 


ataagaagtg 


300 


ggtccagaaa 


tacatcagcg 


acctgaagct 


gaatgcctga 


ggggcctgga 


agctgcgagg 


360 


gcccagtgaa 


cttggtgggc 


ccaggaggga 


acaggagcct 


gagccagggc 


aatggccctg 


420 


ccaccctgga 


ggccacctct 


tctaagagtc 


ccatctgcta 


tgcccagcca 


cattaactaa 


480 


ctttaatctt 


agtttatgca 


tcatatttca 


ttttgaaatt 


gatttctatt 


gttgagctgc 


540 




cagcancc tc 


tctgacatct 


catgacattg 


tctxcatcat 


cctttcccct 


600 


ttcccttcaa 


ctcttcgtac 


attcaatgca 


tggatcaatc 


agtgtgatta 


gctttctcag 


660 


cagacattgt 


gccatatgta 


tcaaatgaca 


aatctttatt 


gaatggtttt 


gctcagcacc 


720 


accttttaat 


atattggcag 


tacttattat 


ataaaaggta 


aaccagcatt 


ctcactgtga 


780 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 








803 



<210> 52 

<211> 5855' 

<212> DNA 

<213> Homo Sapiens 

<400> 52 



cacgcagtcc 


cggcccgagc 


cgacgccttg 


caggagggtt 


caaatccgcg 


cgggggagct 


60 


gcgacgcgca 


agggctgcgg 


agccgcgggc 


cggcgagcgc 


gtcgccacca 


tgaagcagct 


120 


gcctgtaccc 


tgttgaaact 


tcatggccac 


agccccaggc 


cctgctggca 


ttgccatggg 


180 


cagcgtgggc 


agcctgttgg 


aacggcagga 


cttctcccct 


gaagagctgc 


gggcggcact 


240 


tgccgggtct 


cggggctccc 


gccagcctga 


tgggctcctc 


cggaagggct 


tgggccagcg 


300 


tgagttcctc 


agctacctgc 


acctccccaa 


gaaggacagc 


aagagcacca 


agaacaccaa 


360 


gcgggcccct 


cggaacgagc 


ctgccgacta 


tgccaccctc 


tactaccggg 


aacattctcg 


420 


cgcgggtgac 


ttcagcaaga 


cctcgctgcc 


agaacggggt 


cgctttgaca 


agtgccgcat 


480 


tcgcccctca 


gtgttcaagc 


ctacggcggg 


caacgggaaa 


ggcttcctat 


ccatgcaaag 


540 


cctggcgtcc 


cacaaaggcc 


agaagctgtg 


gcgcagcaat 


ggcagcctgc 


acacgctggc 


600 


ctgccacccg 


cccctgagcc 


ccgggccccg 


ggccagccag 


gcccgggcac 


agctgctgca 


660 


cgccctcagc 


ctagatgagg 


gcggccctga 


gcccgagccc 


agcctgtccg 


actcctccag 


720 


tgggggtagt 


tttggtcgca 


gtcctggtac 


tggccctagc 


cccttcagct 


cctcccttgg 


780 


ccaccttaac 


cacctcgggg 


gctccctgga 


ccgggcctct 


caaggaccca 


aggaggctgg 


840 


gccaccagct 


gtgctgagct 


gcctgcccga 


gccaccaccc 


ccctacgagt 


tctcctgctc 


900 


ctctgccgag 


gaaatgggag 


ccgtgctgcc 


cgagacctgt 


gaggagctca 


agaggggcct 


960 


tggcgatgag 


gacggctcca 


accccttcac 


gcaggtgctg 


gaggagcgcc 


agcggctgtg 


1020 
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gctggctgag ctgaagcgcc tgtatgtgga gcggctgcac gaggtgaccc agaaggctga 1080 

gcgcagcgag cgcaacctcc agctgcagct gtttatggct cagcaggagc agcggcgcct H40 

gcgcaaggag ctgcgggctc agcagggcct ggctccggag cctcgggccc ccggcaccct 12 00 

cccagaggct gaccccagtg cacgaccaga ggaggaagcc cgatgggagg tgtgccagaa 1260 

gacagcagag attagcctct tgaagcagca gctgcgtgaa gcccaggcgg aactggccca 1320 

gaagctggcg gagatcttca gtctgaagac acaacttcgg ggcagccggg cacaagccca 1380 

ggctcaggac gcagagctgg tccggctgcg cgaggctgtg cgcagcctgc aggagcaggc 144 0 

ccctcgggag gaagccccag gcagctgtga gactgatgac tgcaagagca ggggcctgct 1500 

aggggaggca ggaggcagcg aggccagaga cagtgctgag cagctgcggg ctgagctgct 1560 

gcaggagcga cttcggggcc aggagcaggc gctgcgcttt gagcaggagc ggcggacttg 1620 

gcaggaggag aaggagcgcg tgctgcgcta ccagcgggag atccagggag ggtacatgga 1680 

catgtaccgc cgcaaccagg cactggagca ggaactgcgg gcactgcggg agccccccac 1740 

accctggagt ccccggctcg agtcctccaa gatctgaggc cagcagagcg agctgacagc 1800 

agcaacactg tcagaaggtg ccctgagacg gccggctcag ccttcccttg cactggttgg 1860 

ggtggaacct gcagaggcca gcccggggct ggggaggcgc aaggagagga gggatccagt 1920 

ggggccgtgg gctgggtagg gtgccttggc aggagccagg acaaggccct cctggcagag 1980 

gagcacctag gcagggccca gccctgcttc ctggagtgga tgtggcccag agaaggaggc 2040 

tgggggatca ccagccccaa ggtcccgaag ggcaggtcag agggagagag gctggagacc 2100 

tgggctggag ccttcctcca gggaaggagg ctggggtggg aacactggcc tcccccagaa 2160 

taaaaccatg ttttctacca gaggctcaga atacgctgag cctgtgacca gaggatgatg 2220 

gatggtcggg attgaggctg ttgacctggg cagtagctcc tcccatggcc agtggtcagt 2280 

gggagggtgt gccctgcgcc tgtctgcatg gccactgggc atgtgtgttg ggagcagagg 234 0 

agtcctactc cttgcctcag ccccacacgg ttccttagct gccgtgtggg ctgaaattcc 2400 

tttctttagc accagtggag ttttcagaag gaacaacacc agggaatgcc aaaaaaacaa 2460 

agggcaagtc aaccaaggca ttttgagaac atgaaatgtc ttcttggtgg gaaggctggg 2520 

ccccaggagt atccacaggc tcaggccatg cccctccccg cacccacctc ctgtctgcct 2580 

ccctccctcc ataggagtgg ggggccccta gaagtgctct gcctcattcc ctgttcgtag 2640 

gaaggtgcag gagaggaggt gggcaagctt atgggtttct tggtaaagag cctttaccac 2700 

agcaaggaat gggaacattt ccccatcagc aacggggctc tagggcatta ttaagtaggg 2760 

gtgaaatatg attgatttgc attctggaaa gctctcccag gaggaagcat tccccacccc 2820 
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accttgcctc 


tgtcctgcgc 


tgggctccag 


gacggtcagt 


cctcccagat 


ccctgctctc 


2880 


agctaaggct 


ggtccctaaa 


acccacacct 


gcctttggtc 


tctcagaggc 


ctggcttgcc 


2940 


ctctgggcct 


ttgctccccc 


gctgggtgcc 


cggcctggaa 


cacaggttct 


gtctggctgt 


3000 


gtgagtggcg 


tctctccctt 


cctttgaggg 


aagctcagct 


ttcttacgcg 


ctcatcgttg 


3060 


gacgggctgc 


catgcattgc 


gtgcttactg 


ggcgccaggt 


gctgtgctgg 


gcccttgtga 


3120 


gtgtgggggc 


acaggctgaa 


ttagatgagc 


tcctgccgca 


gggggccctg 


cacactgatg 


3180 


tcagctcacc 


agtctcttcc 


ctgacagggc 


agctcaggga 


acttctgaga 


agttatctgc 


3240 


agagagcccc 


ccctcactcc 


ctttggctac 


cctggccgtg 


cctggcagcc 


cctcaggccc 


3300 


cacaggtggg 


tcctggttgt 


gaagtggagt 


tagtgcacat 


ggatgtcagg 


ggacgggtgt 


3360 


gaggcatgag 


ctgggggggc 


cccctttttc 


agggcagaga 


atgaggcgtg 


tgcactgtct 


3420 


gctgtgggga 


tggtgtgttt 


atggccatgt 


aagtggcagg 


tgtgtgtaaa 


gatgtgcttg 


3480 


tgggtggtgg 


gtgtgtctga 


gggggtgtgt 


ttgggagcct 


ggcctccaaa 


agccagtgtc 


3540 


tgggttggag 


acagtgcagg 


ccccttcttg 


gccccaccct 


ctaccccatt 


ccttcctgtc 


3600 


agccccagtc 


tgtcttggca 


ctggccaggc 


ttgggtggag 


cctgtgtccc 


agacagatcc 


3660 


cctaggccta 


tctgagaccc 


atgcaggccc 


acacctgagc 


tcctgtctgc 


ctgaggaggg 


3720 


aaagggggag 


tggccaggtc 


aggcaggcca 


gtcgtctaca 


cacccccttg 


gtacccattc 


3780 


ccatttcaca 


aactcctcca 


aaccccaaac 


cccaaacgga 


aatctctggg 


acaatggctc 


3840 


ctccagaggg 


cccctcagag 


ccaggtgctg 


gccagatgcc 


ttgatcacag 


cctccatggc 


3900 


caggggcctg 


tcaagccagg 


ggcccacata 


cctggtgcca 


gggcccacct 


cggtatacca 


3960 


gctcggcaga 


acagttcccc 


acccagcgat 


tagggagctg 


ggctggccag 


tacagccagt 


4020 


ggggagtggg 


tgggaggcac 


caaaacttgg 


cagctgtgcc 


agaaaccaaa 


gccaggccca 


4080 


gatgtagggg 


gagggcacga 


gagctgaacc 


tgagcctggg 


gcgtctcgga 


gctggatccc 


4140 


cgtcagctca 


tccaggctgg 


acaaaccagt 


tcctgggtcc 


cccagccggc 


tgggtggctc 


4200 


tggggccaac 


cgaccacagg 


tgtcactgat 


cacaggtgtc 


accgtagtgg 


ctaccacctc 


4260 


agggcagccc 


ctccctccag 


ctggcggggc 


ctcagcttcc 


cagctgccct 


ggcttagcct 


4320 


tccctgcagc 


ccctccccaa 


caggcttggt 


ccgggcagcc 


tgctctgcgg 


aaggtggcag 


4380 


cctcctggtg 


ctgggctccc 


tgcctctggg 


gcctccagtc 


cttcactcag 


tgtcatccct 


4440 


tccagcagct 


ccctgggcct 


cttggtccafc 


cccttcaggc 


ttctgacaca 


ctagtggcct 


4500 


cgagtcactg 


atagtggtgc 


ctcctcctta 


ccttggcaag 


cccttgtagc 


cctctgagcc 


4560 


ctcatcctgc 


aggcagagcc 


cctcagtgct 


ctgagaaacc 


tgccctgaac 


tcaccgcctc 


4620 


ccctgcctgg 


gcactccctc 


ctccctgcgc 


ccacagcccc 


catgccaatc 


tccgtcagct 


4680 
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ctttctgtgc 


tgctgtaatt 


gttggtttgc 


ctgtctcctt tgagctgtgg gcattgccgg 


4740 


ggtagggtgg 


tgtcttcacc 


acagctccag 


aaggcactgg tgggatgtgg agtcaggagc 


4800 


aaccagagac 


cccccaatac 


tagaatgggt 


ttgagcttgg ggtggggtgg atggggaaga 


4860 


cttactctga 


atgttcgttc 


accatccgtg 


ggcaccaggt ctgtgccaag cacagggact 


4920 


ccaggggcag 


ctgccattcg 


ttccagtgat 


gtatttgggg ccttctcagg tgaggaagcc 


4980 


aaggtggcca 


aggccctcgg 


tcccctgccc 


tgatagggct tacctgtggg tgggggaggg 


5040 


agaccacaca 


tgtcagaggt 


agtgaaggcc 


gctccaaata gtgactgggg ccaggcaggc 


5100 


ggaaaagaga 


acagaagaga 


atccagaagt 


gttcagacag aactagggac aacagagggg 


5160 


cctccatggt 


ggcatgggtc 


agcagagcat 


ggagcaggca gaggaagcct atctgtgggg 


5220 


ctgggtacat 


ttctctgaga 


ctcacggaat 


gtaagtgttg aggtttctgc aaagagggcg 


5280 


caggccctgc 


agtgacggtt 


cagatattga 


taacccatcc cccttggcag gtgggggctt 


5340 


aggcatccat 


attggattca 


gaaaagaaaa 


agaatgatgc ttcttgcaac aaaaacaaaa 


5400 


gtgattatgt 


cacgagagga 


gaggctggag 


ggccatggga gttcccagct ggctcactca 


5460 


tccgtccagg 


aggtgggtct 


catggtttca 


ttttgctttg ggcttactta gagttttggc 


5520 


aaatgcagag 


ttatgtaacg 


aacccaacaa 


tgaagataca gttctgtcac ccccccagca 


5580 


ttctcctgtg 


gccctctagt 


ccaccccccg 


ccccacccca gcccctggca accactgatc 


5640 


tgctttcaca 


ataaccgtat 


ggggtttttt 


ctctttcctt ttttgtgagc ggtatttaat 


5700 


tcctttgctc 


taatatgtaa 


tattagaaca 


tattacaaag tagcaagaaa aatacgacat 


5760 


tggtgtaaaa 


ataatctaca 


cctcatggaa 


agaatgaaga gccagaaata gattctattc 


5820 


tgtagaaaat 


cttacacaat 


aaaggacttc 


aaaat 


5855 


<210> 53 

<211> 2022 

<212> DNA 

<213> Homo Sapiens 








<400> 53 
atgcctggcc 


agaagttctt 


cctagaggtt 


ctatgctgtc ctagcaagaa ttggcgatcg 


60 


agcgccgcgg 


agcgcgtccc 


tccctcgcca 


atccggctcc ggcgccggcg cccgcccgcg 


120 


ttttcccggc gcctgccgct 


ccgccgctcc 


gacccggcac gcagtcccgg cccgagccga 


180 


cgccttgcag gagggttcaa 


atccgcgcgg 


gggagctgcg acgcgcaagg gctgcggagc 


240 


cgcgggccgg cgagcgcgtc 


gccaccatgg 


gcagctgtgt cttccatctc caccaaggat 


300 


tggtccgaaa 


gcaattcctc 


tccgtgttca 


gagattccag tgctgcctgc taatcttggg 


360 


gactggagag ggatttggtg 


gggaacatgg 


caggaggccc caggccctgc tggcattgcc 


420 
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atgggcagcg 


tgggcagcct 


gttggaacgg 


caggacttct 


cccctgaaga 


gctgcgggcg 


480 


gcacttgccg 


ggtctcgggg 


ctcccgccag 


cctgatgggc 


tcctccggaa 


gggcttgggc 


540 


cagcgtgagt 


tcctcagcta 


cctgcacctc 


cccaagaagg 


acagcaagag 


caccaagaac 


600 


accaagcggg 


cccctcggaa 


cgagcctgcc 


gactatgcca 


ccctctacta 


ccgggaacat 


660 


tctcgcgcgg 


gtgacttcag 


caagacctcg 


ctgccagaac 


qqggtcgctt 


tgacaagtgc 


720 


cgcattcgcc 


cctcagtgtt 


caagcctacg 


gcgggcaacq 


qqaaaqqctt 


cctatccatg 


780 


caaagcctgg 


cgtcccacaa 


aggccagaag 


ctgtggcgca 


gcaatggcag 


cctgcacacg 


840 


ctggcctgcc 


acccgcccct 


gagccccggg 


ccccgggcca 


qccaqqcccq 


qqcacaqctq 


900 


ctgcacgccc 


tcagcctaga 


tgagggcggc 


cctgagcccg 


agcccagcct 


gtccgactcc 


960 


tccagtgggg 


■gtagttttgg 


tcgcagtcct 


ggtactggcc 


ctagcccctt 


cagctcctcc 


1020 


cttggccacc 


ttaaccacct 


cgggggctcc 


ctggaccggg 


cctctcaagg 


acccaaggag 


1080 


gctgggccac 


cagctgtgct 


gagctgcctg 


cccgagccac 


caccccccta 


cgagttctcc 


1140 


tgctcctctg 


ccgaggaaat 


gggagccgtg 


ctgcccgaga 


cctgtgagga 


gctcaagagg 


1200 


ggccttggcg 


atgaggacgg 


ctccaacccc 


ttcacgcagg 


tgctggagga 


gcgccagcgg 


1260 


ctgtggctgg 


ctgagctgaa 


gcgcctgtat 


gtggagcggc 


tqcacgaggt 


gacccagaag 


1320 


gctgagcgca 


gcgagcgcaa 


cctccagctg 


cagctgttta 


tggctcagca 


aqaqcaqcqq 


1380 


cgcctgcgca 


aggagctgcg 


ggctcaqcag 


qqcctqqctc 


cqqaqcctcq 


qqcccccqqc 

3 3 ^3 3 ^ 


1440 


accctcccag 


aggctgaccc 


cagtgcacga 


ccaqaqqaqq 


aaqcccqatcr 


qqaqqtofcoc 

533**33 w 3 k 'y v ' 


1500 


cagaagacag 


cagagattag 


cctcttgaag 


caqcaqctqc 


qtqaaqccca 


aa ccrcraac t cr 

33 ^-3 3 ^C*-*- '-3 


1560 


gcccagaagc 


tggcggagat 


cttcagtctg 


aagacacaac 


ttcqqqqcaq 


ccQaacacaa 


* 1620 


gcccaggctc 


aggacgcaga 


gctggtccgg 


ctgcgcgagg 


ctgtqcqcaq 

-3 3 3 3 


cctgcaggag 


1680 


caggcccctc 


gggaggaagc 


cccaggcagc 


tgtgagactg 


atgactgcaa 


qaqcaqqqqc 

3 "3 W **3P33^ 


1740 


ctgctagggg 


aggcaggagg 


cagcgaggcc 


agagacagtg 


ctgagcagct 


gcgggctgag 


1800 


ctgctgcagg 


agcgacttcg 


gggccaggag 


caggcgctgc 


gctttgagca 


ggagcggcgg 


1860 


acttggcagg 


aggagaagga 


gcgcgtgctg 


cgctaccagc 


gggagafccca 


gggagggtac 


1520 


atggacatgt 


accgccgcaa 


ccaggcactg 


gagcaggaac 


tgcgggcact 


gcgggagccc 


1580 


cccacaccct 


ggagtccccg 


gctcgagtcc 


tccaagatct 


ga 




2022 



<210> 54 

<211> 3805 

<212> DNA 

<213> Homo Sapiens 
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<400> 54 

ggctcggctc 


ctagagctgc 


cacggccatg 


gccagagccc 


gcccgccgcc 


gccgccgtcg 


60 


ccgccgccgg 


ggcttctgcc 


gctgctccct 


ccgctgctgc 


tgctgccgct 


gctgctgctg 


120 


cccgccggct 


gccgggcgct 


ggaagagacc 


ctcatggaca 


caaaatgggt 


aacatctgag 


180 


ttggcgtgga 


catctcatcc 


agaaagtggg 


tgggaagagg 


tgagtggcta 


cgatgaggcc 


240 


atgaatccca 


tccgcacata 


ccaggtgtgt 


aatgtgcgcg 


agtcaagcca 


gaacaactgg 


300 


cttcgcacgg 


ggttcatctg 


gcggcgggat 


gtgcagcggg 


tctacgtgga 


gctcaagttc 


360 


actgtgcgtg 


actgcaacag 


catccccaac 


atccccggct 


cctgcaagga 


gaccttcaac 


420 


ctcttctact 


acgaggctga 


cagcgatgtg 


gcctcagcct 


cctccccctt 


ctggatggag 


480 


aacccctacg 


tgaaagtgga 


caccattgca 


cccgatgaga 


gcttctcgcg 


gctggatgcc 


540 


ggccgtgtca 


acaccaaggt 


gcgcagcttt 


gggccacttt 


ccaaggctgg 


cttctacctg 


600 


gccttccagg 


accagggcgc 


ctgcatgtcg 


ctcatctccg 


tgcgcgcctt 


ctacaagaag 


660 


tgtgcatcca 


ccaccgcagg 


cttcgcactc 


ttccccgaga 


ccctcactgg 


ggcggagccc 


720 


acctcgctgg 


tcattgctcc 


tggcacctgc 


atccctaacg 


ccgtggaggt 


gtcggtgcca 


780 


ctcaagctct 


actgcaacgg 


cgatggggag 


tggatggtgc 


ctgtgggtgc 


ctgcacctgt 


840 


gccaccggcc 


atgagccagc 


tgccaaggag 


tcccagtgcc 


gcccctgtcc 


ccctgggagc 


900 


tacaaggcga 


agcagggaga 


ggggccctgc 


ctcccatgtc 


cccccaacag 


ccgtaccacc 


960 


tccccagccg 


ccagcatctg 


cacctgccac 


aataacttct 


accgtgcaga 


ctcggactct 


1020 


gcggacagtg 


cctgtaccac 


cgtgccatct 


ccaccccgag 


gtgtgatctc 


caatgtgaat 


1080 


gaaacctcac 


tgatcctcga 


gtggagtgag 


ccccgggacc 


tgggtggccg 


ggatgacctc 


1140 


ctgtacaatg 


tcatctgcaa 


gaagtgccat 


ggggctggag 


gggcctcagc 


ctgctcacgc 


1200 


tgtgafcgaca 


acgtggagtt 


tgtgcctcgg 


cagctgggcc 


tgacggagcg 


ccgggtccac 


1260 


atcagccatc 


tgctggccca 


cacgcgctac 


acctttgagg 


tgcaggcggt 


caacggtgtc 


1320 


tcgggcaaga 


gccctctgcc 


gcctcgttat 


gcggccgtga 


atatcaccac 


aaaccaggct 


1380 


gccccgtctg 


aagtgcccac 


actacgcctg 


cacagcagct 


caggcagcag 


cctcacccta 


1440 


tcctgggcac 


ccccagagcg 


gcccaacgga 


gtcatcctgg 


actacgagat 


gaagtacttt 


1500 


gagaagagcg 


agggcatcgc 


ctccacagtg 


accagccaga 


tgaactccgt 


gcagctggac 


1560 


gggcttcggc 


ctgacgcccg 


ctatgtggtc 


caggtccgtg 


cccgcacagt 


agctggctat 


1620 


gggcagtaca 


gccgccctgc 


cgagtttgag 


accacaagtg 


agagaggctc 


tggggcccag 


1680 


cagctccagg 


agcagcttcc 


cctcatcgtg 


ggctccgcta 


cagctgggct 


tgtcttcgtg 


1740 


gtggctgtcg 


tggtcatcgc 


tatcgtctgc 


ctcaggaagc 


agcgacacgg 


ctctgattcg 


1800 
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gagtacacgg 


agaagctgca 


gcagtacatt 


gctcctggaa 


tgaaggttta 


tattgaccct 


1860 


tttacctacg 


aggaccctaa 


tgaggctgtt 


cgggagtttg 


ccaaggagat 


cgacgtgtcc 


1920 


tgcgtcaaga 


tcgaggaggt 


gatcggagct 


ggggaatttg 


gggaagtgtg 


ccgtggtcga 


1980 


ctgaaacagc 


ctggccgccg 


agaggtgttt 


gtggccatca 


agacgctgaa 


ggtgggctac 


2040 


accgagaggc 


agcggcggga 


cttcctaagc 


gaggcctcca 


tcatgggtca 


gtttgatcac 


2100 


cccaatataa 


tccggctcga 


gggcgtggtc 


accaaaagtc 


ggccagttat 


gatcctcact 


2160 


gagttcatgg 


aaaactgcgc 


cctggactcc 


ttcctccggc 


tcaacgatgg 


gcagttcacg 


2220 


gtcatccagc 


tggtgggcat 


gttgcggggc 


attgctgccg 


gcatgaagta 


cctgtccgag 


2280 


atgaactatg 


tgcaccgcga 


cctggctgct 


cgcaacatcc 


ttgtcaacag 


caacctggtc 


2340 


tgcaaagtct 


cagactttgg 


cctctcccgc 


ttcctggagg 


atgacccctc 


cgatcctacc 


2400 


tacaccagtt 


ccctgggcgg 


gaagatcccc 


atccgctgga 


ctgccccaga 


ggccatagcc 


2460 


tatcggaagt 


tcacttctgc 


tagtgatgtc 


tggagctacg 


gaattgtcat 


gtgggaggtc 


2520 


atgagctatg 


gagagcgacc 


ctactgggac 


atgagcaacc 


aggatgtcat 


caatgccgtg 


2580 


gagcaggatt 


accggctgcc 


accacccatg 


gactgtccca 


cagcactgca 


ccagctcatg 


2640 


ctggactgct 


gggtgcggga 


ccggaacctc 


aggcccaaat 


tctcccagat 


tgtcaatacc 


2700 


ctggacaagc 


tcatccgcaa 


tgctgccagc 


.ctcaaggtca 


ttgccagcgc 


tcagtctggc 


2760 


atgtcacagc 


ccctcctgga 


ccgcacggtc 


ccagattaca 


caaccttcac 


gacagttggt 


2820 


gattggctgg 


atgccatcaa 


9atggggcgg 


tacaaggaga 


gcttcgtcag 


tgcggggttt 


2880 


gcatcttttg 


acctggtggc 


ccagatgacg 


gcagaagacc 


tgctccgtat 


tggggtcacc 


2940 


ctggccggcc 


accagaagaa 


gatcctgagc 


agtatccagg 


acatgcggct 


gcagatgaac 


3000 


cagacgctgc 


ctgtgcaggt 


ctgacaccgg 


ctcccacggg 


gaccctgagg 


accgfcgcagg 


3060 


gatgccaagc 


agccggctgg 


actttcggac 


tcttggactt 


ttggatgcct 


ggccttaggc 


3120 


tgtggcccag 


aagctggaag 


tttgggaaag 


gcccaagctg 


ggacttctcc 


aggcctgtgt 


3180 


tccctcccca 


ggaagtgcgc 


cccaaacctc 


ttcatattga 


agatggatta 


ggagaggggg 


3240 


tgatgacccc 


tccccaagcc 


cctcagggcc 


cagaccttcc 


tgctctccag 


caggggatcc 


3300 


ccacaacctc 


acacttgtct 


gttcttcagt gctggaggtc 


ctggcagggt 


caggctgggg 


3360 


taagccgggg 


ttccacaggg 


cccagccctg gcaggggtct 


ggccccccag 


gtaggcggag 


3420 


agcagtccct 


ccctcaggaa 


ctggaggagg ggactccagg 


aatggggaaa 


tgtgacacca 


3480 


ccatcctgaa 


gccagcttgc 


acctccagtt tgcacaggga 


tttgtcctgg 


gggctgaggg 


3540 


ccctgtcccc 


acccccgccc 


ttggtgctgt cataaaaggg 


caggcagggg 


caggctgagg 


3600 


agttgccctt 


tgccccccag 


agactgactc tcagagccag 


agatgggatg 


tgtgagtgtg 


3660 
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tgtgtgtgtg tgtgcgcgcg cgcgcgcgtg tgtgtgtgca cgcactggcc tgcacagaga 3720 

gcatgggtga gcgtgtaaaa gcttggccct gtgccctaca atggggccag ctgggccgac 3780 

agcagaataa aggcaataag atgaa 3 805 

<210> 55 

<211> 1242 

<212> DNA 

<213> Homo Sapiens 

<400> 55 

atgggcggga ctacgctggc ctggagcatg gcacgtgatt ccgccggcct ggttgccggg 60 

aatctggacc tgagcgagaa gcacgatccc cggccgcccc cgctcttgca tccccctggt 120 

cctactgctg tgcttgctgg cgacggttcg ttccggaagt gtgcagagaa gtctacattc 180 

ccatgtcaag ctacagctag agagttgact cctctatttg agccatgcca gccgccacac 240 

ctggtgggga gagttaaagg ccgagaagtg aacacagctc caaccccact gccttgtaga 300 

ccttccggca gacctgtggc aggtggtgga ggtgatgggc caggggggcc ggagccgggc 360 

tgggttgatc ctcggacctg gctaagcttc caaggccctc ctggagggcc aggaatcggg 420 

ccgggggttg ggccaggctc tgaggtgtgg gggattcccc catgcccccc gccgtatgag 480 

ttctgtgggg ggatggcgta ctgtgggccc caggttggag tggggctagt gccccaaggc 540 

ggcttggaga cctctcagcc tgagggtgaa gcaggagtcg gggtggagag caactccgat 600 

ggggcctccc cggagccctg caccgtcacc cctggtgccg tgaagctgga gaaggagaag 660 

ctggagcaaa acccggagga ggcaaggaag gtattcagcc aaacgaccat ctgccgcttt 720 

gaggctctgc agcttagctt caagaacatg tgtaagctgc ggcccttgct gcagaagtgg 780 

gtggaggaag ctgacaacaa tgaaaatctt caggagatat gcaaagcaga aaccctcgtg 840 

caggcccgaa agagaaagcg aaccagtatc gagaaccgag tgagaggcaa cctggagaat 900 

ttgttcctgc agtgcccgaa acccacactg cagcagatca gccacatcgc ccagcagctt 960 

gggctcgaga aggatgtggt ccgagtgtgg ttctgtaacc ggcgccagaa gggcaagcga 1020 

tcaagcagcg actatgcaca acgagaggat tttgaggctg ctgggtctcc tttctcaggg 10B0 

ggaccagtgt cctttcctct ggccccaggg ccccattttg gtaccccagg ctatgggagc 1140 

cctcacttca ctgcactgta ctcctcggtc cctttccctg agggggaagc ctttccccct 1200 

gtctctgtca ccactctggg ctctcccatg cattcaaact ga 1242 

<210> 56 

<2ll> 1380 

<212> DNA 

<213> Homo Sapiens 
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<400> 56 

ctcatttcac caggcccccg gcttggggcg ccttccttcc ccatggcggg acacctggct 60 

tcggatttcg ccttctcgcc ccctccaggt ggtggaggtg atgggccagg ggggccggag 120 

ccgggctggg ttgatcctcg gacctggcta agcttccaag gccctcctgg agggccagga 180 

atcgggccgg gggttgggcc aggctctgag gtgtggggga ttcccccatg ccccccgccg 240 

tatgagttct gtggggggat ggcgtactgt gggccccagg ttggagtggg gctagtgccc 300 

caaggcggct tggagacctc tcagcctgag ggcgaagcag gagtcggggt ggagagcaac 360 

tccgatgggg cctccccgga gccctgcacc gtcacccctg gtgccgtgaa gctggagaag 420 

gagaagctgg agcaaaaccc ggaggagtcc caggacatca aagctctgca gaaagaactc 480 

gagcaatttg ccaagctcct gaagcagaag aggatcaccc tgggatatac acaggccgat 540 

gtggggctca ccctgggggt tctatttggg aaggtattca gccaaacgac catctgccgc 600 

tttgaggctc tgcagcttag cttcaagaac atgtgtaagc tgcggccctt gctgcagaag 660 

tgggtggagg aagctgacaa caatgaaaat cttcaggaga tatgcaaagc agaaaccctc 720 

gtgcaggccc gaaagagaaa gcgaaccagt atcgagaacc gagtgagagg caacctggag 780 

aatttgttcc tgcagtgccc gaaacccaca ctgcagcaga toagccacat cgcccagcag 840 

cttgggctcg agaaggatgt ggtccgagtg tggttctgta accggcgcca gaagggcaag 900 

cgatcaagca gcgactatgc acaacgagag gattttgagg ctgctgggtc tcctttctca 960 

gggggaccag tgtcctttcc tctggcccca gggccccatt ttggtacccc aggctatggg 1020 

agccctcact tcactgcact gtactcctcg gtccctttcc ctgaggggga agcctttccc 1080 

cctgtctccg tcaccactct gggctctccc atgcattcaa actgaggtgc ctgcccttct 1140 

aggaatgggg gacaggggga ggggaggagc tagggaaaga aaacctggag tttgtgccag 1200 

ggtttttggg attaagttct tcattcacta aggaaggaat tgggaacaca aagggtgggg 1260 

gcaggggagt ttggggcaac tggttggagg gaaggtgaag ttcaatgatg ctcttgattt 1320 

taatcccaca tcatgtatca cttttttctt aaataaagaa gcctgggaca cagtagatag 1380 

<210> 57 

<211> 1855 

<212> DNA 

<213> Homo Sapiens 

<400> 57 

tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct 60 

tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt 120 

ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac 180 
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tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa 240 

tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga 300 

gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca 360 

tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat 420 

ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct 480 

gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc 540 

ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg 600 

ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct 660 

atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt 720 

atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat 780 

ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt 840 

ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg 900 

tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca 960 

tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc 1020 

tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt 1080 

ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg 1140 

gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt 1200 

ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg 1260 

acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc 1320 

cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc 1380 

ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc 1440 

ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt 1500 

tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt 1560 

tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg 1620 

aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat 1680 

gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat 1740 

ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa 1800 

tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt 1855 

<210> 58 

<211> 8619 

<212> PNA 

<213> Homo Sapiens 
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<400> 58 



atgcttcagt gtacaccagc 


caatatggta gaagttcaca 


aagacaaaga 


gtcaagcaaa 


60 


ggtcacacta gacacaaagt 


ggaagaagct cttattaatg 


aagaagcaat 


tttgaacctt 


120 


atggaaaata gtcagacttt 


tcagcctttg acccaaagac 


tgagtgagtc 


acctgttttc 


180 


atggacagta gtcctgatga 


ggctctggta catcttcttg 


ctggtttgga 


aagtgatgga 


240 


tatcgggggg aaagaaatag 


gatgccatca ccatgtcgct 


cctttggaaa 


taataaatat 


300 


ccacaaaata gtgatgatga 


agaaaatgaa ccacagattg 


aaaaagagga 


aatggagctt 


360 


agtttggtga tgtcccagag 


atgggacagc aatattgaag 


aacattgtgc 


caaaaagaga 


420 


tcactgtgca gaaataccca 


cagaagttca actgaagatg 


atgactcatc 


ttcaggagaa 


480 


gaaatggaat ggagtgataa 


cagtttgctt ctagccagtc 


tttctatacc 


tcagttagat 


540 


ggaactgcag atgaaaatag 


tgacaatcca ttgaacaatg 


aaaattctag 


aacccactct 


600 


tctgtaattg caacaagcaa 


gctttcagtt aaaccctcca 


tctttcacaa 


agatgctgct 


660 


acattagaac cctcatcttc 


tgctaagatt acctttcagt 


gtaaacacac 


aagtgccctt 


720 


tcttcccatg ttttgaacaa 


ggaagattta attgaagacc 


tttcacagac 


aaacaaaaat 


780 


acagaaaaag gtctagataa 


ctcagtcact tcttttacaa 


acgaaagcac 


ttattctatg 


840 


aaataccctg gatctttaag 


cagtactgtt cattcagaaa 


attctcataa 


agagaatagt 


900 


aagaaagaga tcctcccagt 


atcttcctgt gaaagtagta 


tttttgatta 


tgaagaagat 


960 


attccatctg ttacaagaca 


agtaccaagt agaaaatata 


caaacattag 


aaaaatcgaa 


1020 


aaggattccc cttttataca 


tatgcaccgt caccctaacg 


agaatacatt 


gggcaaaaat 


1080 


tctttcaact tttctgactt 


aaatcattca aaaaataaag 


tatcctctga 


aggaaatgaa 


1140 


aaaggaaaca gcacagctct 


gagtagttta ttcccttcat 


catttactga 


aaattgtgaa 


1200 


ttactgtcat gctcagggga 


gaatagaact atggtgcatt 


ctcttaatag 


cactgctgat 


1260 


gaaagtggac taaataaact 


taaaattagg tatgaagaat 


ttcaagaaca 


taaaacagaa 


1320 


aagccaagcc tcagccagca 


agcagcacac tatatgtttt 


ttcccagtgt 


tgttctttct 


1380 


aactgtctta ctagaccaca 


gaaactatct cctgtcacat 


ataaattaca 


acctggcaat 


1440 


aaaccatccc ggttaaaatt 


gaataaaagg aaacttgcag 


gtcatcagga 


gacttctacc 


1500 


aaaagtagtg agactggatc 


cacaaaagat aattttatac 


aaaataatcc 


ttgtaatagt 


1560 


aatcctgaga aggataatgc 


attggctagt gatttaacta 


aaaccactcg 


tggagctttt 


1620 


gaaaataaaa cacccacaga 


tggttttata gactgtcact 


ttggagatgg 


aacgttagaa 


1680 


actgagcagt cctttggact 


atatggaaat aaatacacac 


ttagagccaa 


acgcaaggta 


1740 


aattatgaga ctgaagacag 


tgagtcaagt tttgtaactc 


acaactcaaa 


aattagtcta 


1800 
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cctcatccca 


tggaaattgg 


tgaaagttta 


gatggaactc 


tcaaatcccg 


aaaacgaaga 


1860 


aaaatgtcta 


aaaagctgcc 


ccctgtcatc 


ataaagtata 


ttattattaa 


tagatttaga 


1920 


gggagaaaaa 


atatgcttgt 


gaagctagga 


aaaatagact 


ctaaagaaaa 


acaagtaata 


1980 


ttaacagaag 


aaaaaatgga 


actatataaa 


aagcttgcac 


ctttgaagga 


cttttggcca 


2040 


aaagttcccg 


actcccctgc 


aaccaaatat 


cccatttatc 


cactaacacc 


aaagaaaagt 


2100 


cacagaagaa 


agtcaaaaca 


taaatctgct 


aagaaaaaaa 


ctggtaaaca 


acaaaggaca 


2160 


aataatgaaa 


atattaaaag 


aactttgtct 


ttcaggaaaa 


aacggtcaca 


tgctattctt 


2220 


tctcctccct 


caccatctta 


caatgctgaa 


accgaagatt 


gtgacctgaa 


ttatagtgat 


2280 


gttatgtcta 


aactaggttt 


tctttctgag 


agaagcacaa 


gtcccataaa 


ttcttctcca 


2340 


cctcgctgct 


ggtctcccac 


agatccaaga 


gctgaagaaa 


tcatggctgc 


tgcagaaaaa 


2400 


gaggcaatgc 


tttttaaggg 


tcctaatgta 


tataagaaga 


ctgttaattc 


tcgtatagga 


2460 


aaaactagtc 


gcgcaagagc 


acagattaag 


aaatcaaaag 


caaagcttgc 


taatccctct 


2520 


atagttacta 


agaaaaggaa 


caaacgaaat 


cagacaaata 


aactagtaga 


tgatggaaaa 


2580 


aagaaaccaa 


gagcaaaaca 


aaaaacaaat 


gagaaaggta 


catcgagaaa 


gcatacaaca 


2640 


cttaaggatg 


aaaaaataaa 


atctcagtct 


ggtgctgagg 


ttaagtttgt 


actgaaacac 


2700 


cagaatgtgt 


ctgaatttgc 


aagtagttct 


ggaggctctc 


aactactttt 


taaacagaaa 


2760 


gatatgccac 


taatgggctc 


tgctgtagat 


catccccttt 


ctgcttccct 


acccactgga 


2820 


attaatgcac 


aacagaagtt 


atctggctgc 


ttttcttctt 


tcttagaaag 


caagaagtct 


2880 


gtagatttgc 


agacattccc 


cagttcacga 


gatgatttgc 


atccatcagt 


tgtttgtaat 


2940 


tctataggac 


ctggagtctc 


aaaaattaat 


gttcaaaggc 


ctcataatca 


aagtgctatg 


3000 


tttactctaa 


aggaatcaac 


gttaattcaa 


aaaaatatat 


ttgacctttc 


caatcattta 


3060 


tctcaggtag 


cacagaatac 


acagatatct 


tctggtatgt 


cctcaaagat 


agaagataat 


3120 


gcaaataata 


tacaaagaaa 


ctatttgtca 


tcaatcggaa 


agttaagtga 


atatcgcaat 


3180 


tccctagaat 


caaagctgga 


ccaagcatat 


acccctaatt 


ttttgcattg 


caaagacagt 


3240 


cagcagcaga 


ttgtgtgcat 


agcggaacag 


tcaaagcaca 


gtgaaacttg 


ttctccggga 


3300 


aauacagcuc 


cagaggaaag 


ccaaaugccc 


aacaancgcc 


fcfcgfcaacfctc 


ctfcgagaagt 


3 3 60- 


ccaatcaaac 


aaatagcatg 


ggagcaaaag 


caaaggggct 


ttattttaga 


tatgtcaaat 


3420 


tttaaacctg 


aaagagtaaa 


accgaggtcg 


ttatcagaag 


caatttcaca 


aaccaaagca 


3480 


ctttctcagt 


gtaaaaatcg 


aaatgtgtca 


acaccttcag 


catttggtga 


aggacagtct 


3540 


ggactggcag 


ttctaaaaga 


attgttacaa 


aaaagacagc 


agaaagcaca 


aaatgcaaat 


3600 
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actacacaag acccattatc 


caataaacat caaccaaata 


aaaatatttc 


tggttccctt 


3660 


gagcataaca aagcaaataa 


acggacacga tcggtaacgt 


ccccaagaaa 


acctcgaact 


3720 


cccagaagta caaaacaaaa 


agaaaaaatc cccaaacttc 


tcaaagtaga 


ctctttaaat 


3780 


ttacaaaact ctagccagtt 


ggataactct gtatcagatg 


atagtcccat 


ctttttttca 


3840 


gatccaggct ttgaaagttg 


ttactcactt gaagatagtt 


tatctcctga 


acataattat 


3500 


aattttgata ttaacacaat 


aggtcagact ggattttgta 


gcttttattc 


tggaagtcag 


3960 


tttgtcccag ctgatcagaa 


tttgcctcag aagttcctaa 


gtgatgctgt 


tcaggatctt 


4020 


tttccaggac aagctataga 


aaaaaatgag tttttaagtc 


atgacaacca 


gaaatgtgat 


4080 


gaagacaagc atcataccac 


agactcagcc tcatggatta 


gatctggtac 


tttaagtcct 


4140 


gaaatttttg agaagtcaac 


catagatagc aatgagaatc 


gtcgccacaa 


ccagtggaaa 


4200 


aatagctttc atcctctaac 


aactcggtct aactcaataa 


tggattcttt 


ctgtgttcag 


4260 


caggcagaag actgtctaag 


tgaaaaatct agattgaata 


ggagttcagt 


aagcaaagaa 


4320 


gtgtttctta gcctcccaca 


gccaaacaat tcagactgga 


ttcaaggtca 


caccagaaaa 


4380 


gaaatgggac agtctcttga 


ctcagccaat acctctttta 


ctgcaatact 


ctcctcccct 


4440 


gatggtgaac ttgtagacgt 


ggcctgtgaa gatttagaac 


tgtatgtttc 


aagaaacaat 


4500 


gatatgttga caccaactcc 


tgatagttca ccaagatcta 


ctagctctcc 


ttcacaatct 


4560 


aaaaatggca gcttcacccc 


tcgaactgct aacattctga 


aaccacttat 


gtccccccca 


4620 


agtagggaag aaattatggc 


aactttgttg gatcatgacc 


tgtctgagac 


tatttaccag 


4680 


gaaccatttt gcagtaatcc 


ttctgatgta ccagaaaagc 


ccagggagat 


tggtggacgg 


4740 


ctcctcatgg tagaaactcg 


acttgcaaat gatctggctg 


agtttgaggg 


agacfctttcc 


4800 


ttggaaggac ttcgtctttg 


gaaaacagca ttctcagcaa 


tgactcagaa 


tccaaggcca 


4860 


gggtcacccc ttcgcagtgg 


ccaaggagtt gtcaataaag 


ggtcaagtaa 


tagccctaag 


4920 


atggttgaag ataaaaaaat 


tgtgattatg ccttgcaaat 


gtgccccaag 


tcgacaactg 


4980 


gttcaagtgt ggcttcaagc 


caaagaagaa tacgaacgtt 


ccaagaaact 


gcctaaaacc 


5040 


aagccaactg gagttgtaaa 


atctgctgag aactttagct 


cttcagttaa 


cccagatgac 


5100 


aaacctgtag tgcctccaaa 


aatggatgta agtccatgta 


tactccccac 


tacagcacat 


5160 


accaaggagg atgttgataa 


ttctcagatt gctttacaag 


caccaaccac 


gggatgtagt 


5220 


caaactgcaa gtgaaagtca 


gatgctgcca ccagttgcct 


ctgcaagtga 


tcccgaaaaa 


5280 


gatgaagatg atgatgataa 


ctattacatt agttatagct 


cccctgattc 


tccagtaatt 


5340 


cccccttggc aacaaccaat 


atccccagat tccaaagcat 


taaatggaga 


tgatagaccc 


5400 


tcatcaccag tagaggagct 


gccttcattg gcttttgaga 


acttcttaaa 


gccaataaaa 


5460 
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gatggtatac aaaaaagccc ctgcagtgag cctcaagagc ctctagtgat atctccaatt 5520 

aatactaggg caagaactgg gaaatgtgaa tcactttgct ttcatagtac accaatcata 5580 

cagagaaaac ttctggaaag gcttcctgaa gcacctggcc ttagcccatt atcaacagaa 5640 

ccaaaaacac agaagttgag taataagaaa ggaagtaata ctgacactct tagaagagta 5700 

ctgttaacac aagcaaagaa tcaatttgca gcagtaaata ccccacagaa agaaacttct 5760 

cagattgatg gaccatcttt aaacaatact tacggtttca aagtcagcat acaaaactta 5820 

caggaggcaa aagctttaca tgagatacaa aatcttaccc taatcagtgt ggagttgcat 5880 

gctcgaacta gacgagactt agaaccggat cctgaatttg acccaatctg tgctctgttc 5940 

tactgcatct catctgacac tccactgcca gatacagaaa aaacagaact cacaggtgta 6000 

atagtgattg ataaagacaa gacagttttc agtcaagata tcagatatca gactccatta 6060 

cttattagat ctggaattac aggactcgaa gtcacctatg ctgctgatga gaaggcactt 6120 

tttcatgaaa ttgcaaatat aataaagagg tatgatcctg atattctgct aggatatgag 6180 

attcagatgc attcctgggg ttacctctta caaagggctg ccgctttaag tattgactta 6240 

tgtcggatga tctctcgggt gccagatgac aaaattgaga acagatttgc agctgaaaga 6300 

gatgagtatg gatcatatac aatgagtgag ataaatattg ttggccgaat tacactaaat 6360 

ctttggagaa tcatgagaaa tgaggtggct ctaactaact acacctttga aaatgtgagc 642 0 

tttcatgttc ttcatcagcg ttttcccctc tttacctttc gagtcttgtc agactggttt 6480 

gataacaaga cagatctata caggtactgt tctataactc tgaagaagag gcaacagacc 6540 

tctgctttgt accactggca ggtcctgggc ccaatatact tctgggtcat ttttacatct 6600 

tataatatta aaattctttt tatggatttg ctgagggttt tattgtttgt tttcttaaga 6660 

agatggaaaa tggttgatca ttatgttagc cgtgtccgtg gaaatctcca aatgttagaa 6720 

cagctggacc tgattgggaa aaccagtgag atggctagac tttttggcat tcagttttta 6780 

catgtactga caaggggttc acagtaccgt gtggaatcaa tgatgttgcg tattgctaaa 6840 

ccaatgaact atattcctgt gacacctagt gttcagcaaa gatcccagat gagagcccca 6900 

cagtgtgttc ctctaattat ggagcctgaa tcccgcttct atagcaactc tgttctcgtt 6960 

ttggatttcc aatcacttta tccttctatt gtgattgcat ataactactg cttttccacc 7020 

tgccttggcc atgtggagaa cttgggaaag tatgatgagt tcaaatttgg ctgtacctct 7080 

ctgagagtac ctccagattt actttaccaa gttaggcatg atatcacagt gtcccccaat 7140 

ggagtagctt ttgtcaagcc ttcagtaaga aaaggtgtac taccaagaat gcttgaagaa 7200 

attttgaaga ctagatttat ggtgaagcag tcaatgaagg cttacaagca agacagagcc 7260 
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ctgtcacgaa tgcttgatgc gcgtcagttg ggacttaagc tgatagcaaa tgtcacattt 7320 

ggctatacat ctgctaattt ttctgggaga atgccatgca ttgaggttgg cgatagtatt 7380 

gttcacaaag ccagagagac cttggaacga gctattaaac tggtgaatga taccaagaaa 744 0 

tggggggcta gggttgtata tggcgatact gacagtatgt ttgtgctact gaaaggagcc 7500 

actaaggagc agtcttttaa gattggtcag gaaattgccg aagctgtaac tgctaccaat 7560 

cctaaaccag tgaaattgaa gtttgaaaag gtatatttgc cctgtgtttt acaaacaaaa 7620 

aagaggtatg tgggttacat gtatgaaaca ctggatcaga aggacccagt atttgatgca 7680 

aaaggaatag aaacagtcag aagagattcc tgccctgctg tttctaagat acttgagcgt 7740 

tctctaaagc tgctatttga aacgagagat ataagtctaa ttaaacagta tgttcagcga 7800 

caatgtatga agcttctgga aggaaaggcc agcatacaag actttatctt tgccaaggaa 7 860 

tacagaggaa gtttttctta taaaccagga gcttgtgtgc cagcccttga acttacaagt 7 920 

tttttcattg ttttattatt gtttaattct gacttaattt gtgagaaaga tggcttccat 7980 

aacagtattt gggtgtggtt tttttctttg aattcgaata ggaaaatgct gacttatgac 8040 

cggcgctctg agcctcaggt tggggagcga gtgccatacg tcatcattta tgggaccccc 8100 

ggagtaccac ttatccagct tgtaaggcgc ccagtggaag tcctgcagga cccaactctg 8160 

agactgaatg ctacttacta tattaccaag caaatccttc cacccttggc aagaatcttc 8220 

tcacttattg gtattgatgt cttcagctgg tatcatgaat taccaaggat ccataaagct 8280 

accagctcct cgcgaagtga acctgaaggg cggaaaggca ctatttcaca atattttact 834 0 

accttacact gtcctgtgtg tgatgaccta actcagcatg gcatctgtag taaatgtcgg 8400 

agccaacctc agcatgttgc agtcatcctc aaccaagaaa tccgggagtt ggaacgtcaa 8460 

caggagcaac ttgtaaagat atgcaagaac tgtacaggtt gctttgatcg acacatccca 8520 

tgtgtttctc tgaactgccc agtacttttc aaactctccc gagtaaatag agaattgtcc 8580 

aaggcaccat atctccggca gttattagac cagttttaa 8619 

<210> 59 

<211> 2335 

<212> PRT 

<213> Homo Sapiens 

<400> 59 

Met Ala Met Met lie Leu Arg Val Asp Tyr Thr Phe Glu Glu Asn Arg 



Asp Lys Leu Ala Ser Arg Lys Lys Glu Tyr Ser Gin Gly Ser Val Ala 
20 25 30 
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Asp Leu Thr Pro Asp Asn Trp Lys Asn lie Thr Val Pro His Ser Gly 
35 40 45 



Arg His Ser Glu Val Ser Arg Gly Glu Leu Val Cys Arg Thr Cys Ser 
50 55 60 



Glu Cys Ser Ala Gly Pro His lie Trp Met Lys Gly Leu Tyr Gin Thr 
65 70 75 80 



Gin Asp Glu Glu Ala Gly Gly Glu Asn lie Phe lie Leu Leu Phe lie 
85 90 95 



Glu Ser Thr Gin Phe Gly Gin Phe Val Ala Met Gly Ser Pro lie Thr 
100 105 110 



Glu His Lys Val Phe Thr Met Tyr Leu Gly Leu Ala Thr His Leu Phe 
115 120 125 



Tyr Ser Leu lie Thr His Pro Phe Val Leu Leu Glu Asn His Ser Cys 
130 135 140 



Pro Ser Ser Val His Gly Phe Asp Val Ala Gly Leu lie Phe Asp Lys 
145 150 155 160 



Val Gly Met Arg Ser Arg Pro Gly Arg Met Gly Ala Leu Phe Ala Tyr 
165 170 175 



Phe Ala Gly Phe lie Arg Arg Lys Ala Leu Val Val Cys Leu Phe Val 
180 185 190 



Phe Cys Trp Ser Asn Glu Ala Ala Asn Lys Pro Pro lie Gin Glu Ala 
195 200 205 



Ala Gin Leu Ser Arg Pro Ala Gin Gly Ala Arg Arg Ala Ser Glu Arg 
210 215 220 



Lys Phe Leu Ala Phe Ser Cys Pro Leu Ala Gly His Tyr Ala Ala Lys 
225 230 235 240 



Gin Pro Ser Pro Ser Pro Pro Pro Pro Pro Ala Pro Pro Ala Pro Pro 
245 250 255 



Ala Ala Arg Ala Ala Gin Leu Ser Ala Gly Gly Gly Val Ala Gin Pro 
260 265 270 
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Ser Ala Asp Gly Thr Leu Ala Ala Arg Pro Gin Arg Leu Leu Lys Ser 
275 280 " 285 



Lys Val Gly Gly Gly Arg Arg Ala Pro Arg Ala Leu His Gly Arg Cys 
290 295 " 300 



Leu Ala Ser Pro Pro Gin Pro Arg Arg Ala Gly Gly Arg Gly Val Gly 
305 210 315 320 



Ala Ala Glu Gly Gly Val Gly Ser Thr Met Gin Phe Val Ser Trp Ala 
325 330 335 



Thr Leu Leu Thr Leu Leu Val Arg Asp Leu Ala Glu Met Gly Ser Pro 
340 345 350 



Asp Ala Ala Ala Ala Val Arg Lys Asp Arg Leu His Pro Arg Gin Val 
355 360 365 



Lys Leu Leu Glu Thr Leu Ser Glu Tyr Glu lie Val Ser Pro He Arg 
370 375 380 



Val Asn Ala Leu Gly Glu Pro Phe Pro Thr Asn Val His Phe Lys Arg 
385 390 395 " 400 



Thr Arg Arg Ser He Asn Ser Ala Thr Asp Pro Trp Pro Ala Phe Ala 
405 410 415 



Ser Ser Ser Ser Ser Ser Thr Ser Ser Gin Ala His Tyr Arg Leu Ser 
420 425 430 



Ala Phe Gly Gin Gin Phe Leu Phe Asn Leu Thr Ala Asn Ala Gly Phe 
435 440 445 



He Ala Pro Leu Phe Thr Val Thr Leu Leu Gly Thr Pro Gly Val Asn 
450 455 460 



Gin Thr Lys Phe Tyr Ser Glu Glu Glu Ala Glu Leu Lys His Cys Phe 
465 470 475 480 



Tyr Lys Gly Tyr Val Asn Thr Asn Ser Glu His Thr Ala Val He Ser 
485 490 495 



Leu Cys Ser Gly Met Gly Leu Leu Asp Val Ser Glu Leu Ser Gly Val 
500 505 510 



Trp Thr Arg Phe Ser Gly Ala Leu Pro Asn Ala Ala Arg Arg Pro Gly 
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Ser Gin Phe Pro Asn Ser Glu Lys Val Thr Gly Val Ala Val Pro Cys 
530 535 540 



Ser Lys Leu Gly His Pro Gly Ala Glu Pro Leu Ser Ala Gly Arg Thr 
545 550 555 560 



Arg Leu Leu He Val Asp Leu Thr Arg His Leu Pro Pro Thr Ser Pro 
565 570 575 



Arg His Leu Arg Ser Arg Cys Gly Thr Val Leu Ala Arg Ala Arg Val 
580 585 590 



Val Leu Asp Phe Pro Lys Arg Arg Ala Phe Leu Pro Arg Ala Cys Asp 
595 600 605 



Ala Glu Thr Phe Pro Ala Gly Pro Trp lie Leu Thr Pro Arg His Trp 
610 615 620 



Ala Ala Pro Ser Val Arg Cys Arg Ser Trp Val Leu Lys Phe Pro Ser 
625 630 635 640 



Thr Ser Phe Leu Leu Cys Leu Ser Met Glu Gly Ser Gly Gly Glu Arg 
645 650 655 



Gly Lys Pro Glu Asp Trp Glu Gly Val Val Leu Ala Cys Trp Asp Ser 
660 665 670 



Arg Lys Gly He Asn Pro Phe Ser Pro Gin Gin Ser Ala Arg Ser Arg 
675 680 685 



Gly Ser Arg Asn Ala Leu Ser Arg Leu Phe Gly Gly Gly Arg Arg Arg 
690 695 700 



Gin Leu Gly Glu Val Gly Gly Gly Ala Ala Leu Gly Thr Phe Arg Ser 
705 710 715 720 



His Asp Gly Asp Tyr Phe He Glu Pro Leu Gin Ser Met Asp Glu Gin 
725 730 735 



Glu Asp Glu Glu Glu Gin Asn Lys Pro His He He Tyr Arg Arg Ser 
740 745 750 



Ala Pro Gin Arg Glu Pro Ser Thr Gly Arg His Ala Cys Asp Thr Ser 
755 760 765 
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Gly Leu Gin Lys Cys Leu lie Asn Gly Ser His Glu Asn lie Tyr Val 
770 775 780 



Phe Val Glu Cys Phe Leu Glu Thr Ser Gly Leu Leu Met Phe Cys Asp 
785 790 795 800 



Leu Arg Asn Cys Ser Lys Val Pro Val Arg Tyr Ala Val Ser Tyr Phe 
805 810 815 



Cys Thr Pro Ser Leu Asn Ser Asp Ala Ala Ser Gin Asn Ser Leu Glu 
820 825 830 



Tyr Gly Thr lie His Gin Gin Val Ser Glu Glu Trp Thr Asn Arg Ser 
835 840 845 



Arg Thr Pro Leu Glu Pro Glu His Lys Asn Arg His Ser Lys Asp Lys 
850 855 860 



Lys Lys Thr Arg Ala Arg Lys Trp Gly Glu Arg lie Asn Leu Ala Gly 
865 870 875 880 



Asp Val Ala Ala Leu Asn Ser Gly Leu Ala Thr Glu Ala Phe Ser Ala 
885 890 895 



Tyr Gly Asn Lys Thr Asp Asn Thr Arg Glu Lys Arg Thr His Arg Arg 
900 905 910 



Thr Lys Arg Phe Leu Ser Tyr Pro Arg Phe Val Glu Val Leu Val Val 
915 920 . 925 



Ala Asp Asn Arg Met Val Ser Tyr His Gly Glu Asn Leu Gin His Tyr 
930 935 940 



lie Leu Thr Leu Met Ser lie Val Ala Ser lie Tyr Lys Asp Pro Ser 
945 950 955 960 



He Gly Asn Leu He Asn He Val He Val Asn Leu lie Val He His 
965 970 975 



Asn Glu Gin Asp Gly Pro Ser He Ser Phe Asn Ala Gin Thr Thr Leu 
980 985 990 



Lys Asn Phe Cys Gin Trp Gin His Ser Lys Ash Ser Pro Gly Gly He 
995 1000 1005 
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His His Asp Thr Ala Val Leu Leu Thr Arg Gin Asp He Cys Arg 
1010 1015 1020 



Ala His Asp Lys Cys Asp Thr Leu Gly Leu Ala Glu Leu Gly Thr 
1025 1030 1035 



He Cys Asp Pro Tyr Arg Ser Cys Ser He Ser Glu Asp Ser Gly 
1040 1045 1050 



Leu Ser Thr Ala Phe Thr He Ala His Glu Leu Gly His Val Phe 
1055 1060 1065 



Asn Met Pro His Asp Asp Asn Asn Lys Cys Lys Glu Glu Gly Val 
1070 1075 1080 



Lys Ser Pro Gin His Val Met Ala Pro Thr Leu Asn Phe Tyr Thr 
1085 1090 1095 



Asn Pro Trp Met Trp Ser Lys Cys Ser Arg Lys Tyr He Thr Glu 
1100 1105 1110 



Phe Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu Asn Glu Pro Glu 
1115 1120 1125 



Ser Arg Pro Tyr Pro Leu Pro Val Gin Leu Pro Gly He Leu Tyr 
1130 1135 1140 



Asn Val Asn Lys Gin Cys Glu Leu He Phe Gly Pro Gly Ser Gin 
1145 1150 1155 



Val Cys Pro Tyr Met His Cys Lys Tyr Gly Phe Cys Val Pro Lys 
1160 1165 1170 



Glu Met Asp Val Pro Val Thr Asp Gly Ser Trp Gly Ser Trp Ser 
1175 1180 1185 



Pro Phe Gly Thr Cys Ser Arg Thr Cys Gly Gly Gly He Lys Thr 
1190 1195 1200 



Ala He Arg Glu Cys Asn Arg Pro Glu Pro Lys Asn Gly Gly Lys 
1205 1210 1215 



Tyr Cys Val Gly Arg Arg Met Lys Phe Lys Ser Cys Asn Thr Glu 
1220 1225 1230 
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Pro Cys Leu Lys Gin Lys Arg Asp Phe Arg Asp Glu Gin Cya Ala 

1235 1240 ~ 1245 



His Phe Asp Gly Lys His Phe Asn He Asn Gly Leu Leu Pro Asn 
1250 1255 1260 



Val Arg Trp Val Pro Lys Tyr Ser Gly He Leu Met Lys Asp Arg 
1265 1270 1275 



Cys Lys Leu Phe Cys Arg Val Ala Gly Asn Thr Ala Tyr Tyr Gin 
1280 1285 1290 



Leu Arg Asp Arg Val He Asp Gly Thr Pro Cys Gly Gin Asp Thr 
1295 1300 1305 



Asn Asp He Cys Val Gin Gly Leu Cys Arg Gin Ala Gly Cys Asp 
1310 1315 1320 



His Val Leu Asn Ser Lys Ala Arg Arg Asp Lys Cys Gly Val Cys 
• 1325 1330 1335 



Gly Gly Asp Asn Ser Ser Cys Lys Thr Val Ala Gly Thr Phe Asn 
1340 1345 1350 



Thr Val His Tyr Gly Tyr Asn Thr Val Val Arg He Pro Ala' Gly 
1355 1360 1365 



Ala Thr Asn lie Asp Val Arg Gin His Ser Phe Ser Gly Glu Thr 
1370 1375 1380 



Asp Asp Asp Asn Tyr Leu Ala Leu Ser Ser Ser Lys Gly Glu Phe 
1385 1390 1395 



Leu Leu Asn Gly Asn Phe Val Val Thr Met Ala Lys Arg Glu He 
1400 1405 1410 



Arg He Gly Asn Ala Val Val Glu Tyr Ser Gly Ser Glu Thr Ala 
1415 1420 1425 



Val Glu Arg lie Asn Ser Thr Asp Arg He Glu Gin Glu Leu Leu 
1430 1435 1440 



Leu Gin val Leu Ser Val Gly Lys Leu Tyr Asn Pro Asp Val Arg 
1445 1450 1455 



Tyr Ser phe Asn He Pro He Glu Asp Lys Pro Gin Gin Phe Tyr 



WO 2004/073657 PCT/US2 004/005455 

110/282 

1460 1465 1470 



Trp Asn Ser His Gly Pro Trp Gin Ala Cys Ser Lys Pro Cys Gin 
1475 ■ 1480 1485 



Gly Glu Arg Lys Arg Lys Leu Val Cys Thr Arg Glu Ser Asp Gin 
1490 1495 1500 



Leu Thr Val Ser Asp Gin Arg Cys Asp Arg Leu Pro Gin Pro Gly 
1505 1510 1515 



His lie Thr Glu Pro Cys Gly Thr Asp Cys Asp Leu Arg Trp His 
1520 1525 1530 



Val Ala Ser Arg Ser Glu Cys Ser Ala Gin Cys Gly Leu Gly Tyr 
1535 1540 1545 



Arg Thr Leu Asp lie Tyr Cys Ala Lys Tyr Ser Arg Leu Asp Gly 
1550 1555 1550 



Lys Thr Glu Lys Val Asp Asp Gly Phe Cys Ser Ser His Pro Lys 
1565 1570 1575 



Pro Ser Asn Arg Glu Lys Cys Ser Gly Glu Cys Asn Thr Gly Gly 
1580 1585 1590 



Trp Arg Tyr Ser Ala Trp Thr Glu Cys Ser Lys Ser Cys Asp Gly 
1595 1600 1605 



Gly Thr Gin Arg Arg Arg Ala He Cys Val Asn Thr Arg Asn 'Asp 
1610 1615 1620 



Val Leu Asp Asp Ser Lys Cys Thr His Gin Glu Lys Val Thr He 
1625 ■ 1630 1635 



Gin Arg Cys Ser Glu Phe Pro Cys Pro Gin Trp Lys Ser Gly Asp 
1640 1645 1650 



Trp Ser Glu Cys Leu Val Thr Cys Gly Lys Gly His Lys His Arg 
1655 1660 1665 



Gin Val Trp Cys Gin Phe Gly Glu Asp Arg Leu Asn Asp Arg Met 
1670 1675 1680 



Cys Asp 
1685 



Pro Glu Thr Lys Pro Thr Ser Met Gin Thr Cys Gin Gin 
1690 1695 



WO 2004/073657 PCT/US2004/005455 

111/282 



Pro Glu Cys Ala Ser Trp Gin Ala Gly Pro Trp Gly Gin Cys Ser 
1700 1705 1710 



Val Thr Cys Gly Gin Gly Tyr Gin Leu Arg Ala Val Lys Cys lie 
1715 1720 1725 



lie Gly Thr Tyr Met Ser Val Val Asp Asp Asn Asp Cys Asn Ala 
1730 1735 1740 



Ala Thr Arg Pro Thr Asp Thr Gin Asp Cys Glu Leu Pro Ser Cys 
1745 1750 1755 



His Pro Pro Pro Ala Ala Pro Glu Thr Arg Arg Ser Thr Tyr Ser 
1760 1765 1770 



Ala Pro Arg Thr Gin Trp Arg Phe Gly Ser Trp Thr Pro Cys Ser 
1775 1780 1785 



Ala Thr Cys Gly Lys Gly Thr Arg Met Arg Tyr Val Ser Cys Arg 
1790 1795 1800 



Asp Glu Asn Gly Ser Val Ala Asp Glu Ser Ala Cys Ala Thr Leu 
1805 1810 1815 



Pro Arg Pro Val Ala Lys Glu Glu Cys Ser Val Thr Pro Cys Gly 
1820 1825 1830 



Gin Trp Lys Ala Leu Asp Trp Ser Ser Cys Ser Val Thr Cys Gly 
1835 1840 1845 



Gin Gly Arg Ala Thr Arg Gin Val Met Cys Val Asn Tyr Ser Asp 
1850 1855 1860 



His Val lie Asp Arg Ser Glu Cys Asp Gin Asp Tyr He Pro Glu 
1865 1870 1875 



Thr Asp Gin Asp Cys Ser Met Ser Pro Cys Pro Gin Arg Thr Pro 
1880 1885 1890 



Asp Ser Gly Leu Ala Gin His Pro Phe Gin Asn Glu Asp Tyr Arg 
1895 1900 1905 



Pro Arg Ser Ala Ser Pro Ser Arg Thr His Val Leu Gly Gly Asn 
1910 1915 1920 
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Gin Trp Arg Thr Gly Pro Trp Gly Ala Thr Tyr Trp Arg Glu Asn 
1925 1930 1935 



Thr Met Glu Phe Leu Glu Leu Phe Leu Pro Glu Ser Leu Thr Gly 
1940 1945 1950 



Pro Gly Ser Lys Ser Cys Asp Gin His Tyr Gly Ser Thr Cys Ala 
1955 1960 1965 



Gly Gly Ser Gin Arg Arg Val Val Val Cys Gin Asp Glu Asn Gly 
1970 1975 1980 



Tyr Thr Ala Asn Asp Cys Val Glu Arg lie Lys Pro Asp Glu Gin 
1985 1990 1995 



Arg Ala Cys Glu Ser Gly Pro Cys Pro Gin Trp Ala Tyr Gly Asn 
2000 2005 2010 



Trp Gly Glu Cys Thr Lys Leu Cys Gly Gly Gly lie Arg Thr Arg 
2015 2020 2025 



Leu Val Val Cys Gin Arg Ser Asn Gly Glu Arg Phe Pro Asp Leu 
2030 2035 2040 



Ser Cys Glu lie Leu Asp Lys Pro Pro Asp Arg Glu Gin Cys Asn 
2045 2050 2055 



Thr His Ala Cys Pro His Asp Ala Ala Trp Ser Thr Gly Pro Trp 
2060 2065 2070 



Ser Ser Ser Met Trp Gin Val Asn Asn Lys Thr Val Thr Leu Gly 
2075 2080 2085 



Asn Leu Cys Ser Val Ser Cys Gly Arg Gly His Lys Gin Arg Asn 
2090 2095 2100 



Val Tyr Cys Met Ala Lys Asp Gly Ser His Leu Glu Ser Asp Tyr 
2105 2110 2115 



Cys Lys His Leu Ala Lys Pro His Gly His Arg Lys Cys Arg Gly 
2120 2125 2130 



Gly Arg Cys Pro Lys Trp Lys Ala Gly Ala Trp Ser Gin Lys Thr 
2135 2140 2145 



WO 2004/073657 



113/282 



PCT/US2004/005455 



Thr Asn Ser Asp Cys Thr Glu Ala Asp Cys Gly His Leu Ala Glu 
2150 2155 2160 



lie Glu Ser Gin Phe He Leu Glu Val Leu Glu Glu Arg Ala Val 
2165 2170 2175 



Asp Glu Ser Ser Arg Lys Tyr Leu Cys Pro Phe Ala Cys Leu Gin 
2180 2185 2190 



Lys Cys Ser Val Ser Cys Gly Arg Gly Val Gin Gin Arg His Val 
2195 2200 2205 



Gly Cys Gin He Gly Thr His Lys He Ala Arg Glu Thr Glu Cys 
2210 2215 2220 



Asn Pro Tyr Thr Arg Pro Glu Ser Glu Arg Asp Cys Gin Gly Pro 
2225 2230 2235 



Arg Cys Pro Leu Tyr Thr Trp Arg Ala Glu Glu Trp Gin Glu Thr 
2240 2245 2250 



Tyr His Gly Leu Leu Ser Pro Ser Pro Ser Leu Cys His Ala Lys 
2255 2260 2265 



Leu Asn Pro Ala Pro Arg Ser Gly Lys Pro Gin Pro Arg Cys His 
2270 2275 2280 



Phe Leu Ser Glu Ala Phe Ala Asn His Thr Thr Pro Leu Asn Leu 
2285 2290 2295 



Ser Gin Met Leu Leu His Ser Ala Leu Thr Thr His Ala Asp Tyr 
2300 2305 2310 



Cys Thr Leu Ala Val Asn Thr Trp Asn Ser His Cys Leu Phe Phe 
2315 2320 2325 



Ser Ser Met Leu Ser Val He 
2330 2335 



<210> 60 

<211> 1072 

<212> PRT 

<213> Homo Sapiens 

<400> 60 

Met Gin Phe Val Ser Trp Ala Thr Leu Leu Thr Leu Leu Val Arg Asp 
15 10 15 
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Leu Ala Glu Met Gly Ser Pro Asp Ala Ala Ala Ala Val Arg Lys Asp 
20 25 30 



Arg Leu His Pro Arg Gin Val Lys Leu Leu Glu Thr Leu Gly Glu Tyr 
35 40 45 



Glu lie Val Ser Pro lie Arg Val Asn Ala Leu Gly Glu Pro Phe Pro 
50 55 60 



Thr Asn Val His Phe Lys Arg Thr Arg Arg Ser lie Asn Ser Ala Thr 
65 70 75 80 



Asp Pro Trp Pro Ala Phe Ala Ser Ser Ser Ser Ser Ser Thr Ser Ser 
85 90 95 



Gin Ala His Tyr Arg Leu Ser Ala Phe Gly Gin Gin Phe Leu Phe Asn 
100 105 110 



Leu Thr Ala Asn Ala Gly Phe lie Ala Pro Leu Phe Thr Val Thr Leu 
115 120 125 



Leu Gly Thr Pro Gly Val Asn Gin Thr Lys Phe Tyr Ser Glu Glu Glu 
130 135 140 



Ala Glu Leu Lys His Cys Phe Tyr Lys Gly Tyr Val Asn Thr Asn Ser 
145 150 155 160 



Glu His Thr Ala Val lie Ser Leu Cys Ser Gly Met Leu Gly Thr Phe 
165 170 175 



Arg Ser His Asp Gly Asp Tyr Phe lie Glu Pro Leu Gin Ser Met Asp 
180 185 190 



Glu Gin Glu Asp Glu Glu Glu Gin Asn Lys Pro His lie He Tyr Arg 
195 200 205 



Arg Ser Ala Pro Gin Arg Glu Pro Ser Thr Gly Arg His Ala Cys Asp 
210 215 220 



Thr Ser Glu His Lys Asn Arg His Ser Lys Asp Lys Lys Lys Thr Arg 
225 230 235 ' 240 



Ala Arg Lys Trp Gly Glu Arg He Asn Leu Ala Gly Asp Val Ala Ala 
245 250 255 
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Leu Asn Ser Gly Leu Ala Thr Glu Ala Phe Ser Ala Tyr Gly Asn Lys 
260 265 270 



Thr Asp Asn Thr Arg Glu Lys Arg Thr His Arg Arg Thr Lys Arg Phe 
275 280 285 



Leu Ser Tyr Pro Arg Phe Val Glu Val Leu Val Val Ala Asp Asn Arg 
290 295 300 



Met Val Ser Tyr His Gly Glu Asn Leu Gin His Tyr He Leu Thr Leu 
305 310 315 320 



Met Ser He Val Ala Ser He Tyr Lys Asp Pro Ser He Gly Asn Leu 
325 330 335 



He Asn He Val He Val Asn Leu lie Val lie His Asn Glu Gin Asp 
340 345 350 



Gly Pro Ser He Ser Phe Asn Ala Gin Thr Thr Leu Lys Asn Leu Cys 
355 360 365 



Gin Trp Gin His Ser Lys Asn Ser Pro Gly Gly He His His Asp Thr 
370 375 380 



Ala Val Leu Leu Thr Arg Gin Asp He Cys Arg Ala His Asp Lys Cys 
385 390 395 400 



Asp Thr Leu Gly Leu Ala Glu Leu Gly Thr He Cys Asp Pro Tyr Arg 
405 410 ' 415 



Ser Cys Ser lie Ser Glu Asp Ser Gly Leu Ser Thr Ala Phe Thr He 
420 425 430 



Ala His Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Asn Asn 
435 440 445 



Lys Cys Lys Glu Glu Gly Val Lys Ser Pro Gin His Val Met Ala Pro 
450 455 460 



Thr Leu Asn Phe Tyr Thr Asn Pro Trp Met Trp Ser Lys Cys Ser Arg 
465 470 475 480 



Lys Tyr He Thr Glu Phe Leu Asp Thr Gly Tyr Gly Glu Cys Leu Leu 
485 490 " 495 
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Asn Glu Pro Glu Ser Arg Pro Tyr Pro Leu Pro Val Gin Leu Pro Gly 
500 505 510 



lie Leu Tyr Asn Val Asn Lys Gin Cys Glu Leu lie Phe Gly Pro Gly 
515 520 525 



Ser Gin Val Cys Pro Tyr Met Met Gin Cys Arg Arg Leu Trp Cys Asn 
530 535 540 



Asn Val Asn Gly Val His Lys Gly Cys Arg Thr Gin His Thr Pro Trp 
545 550 555 560 



Ala Asp Gly Thr Glu Cys Glu Pro Gly Lys His Cys Lys Tyr Gly Phe 
565 570 575 



Cys Val Pro Lys Glu Met Asp Val Pro Val Thr Asp Gly Ser Trp Gly 
580 585 590 



Ser Trp Ser Pro Phe Gly Thr Cys Ser Arg Thr Cys Gly Gly Gly lie 
595 600 605 



Lys Thr Ala Xle Arg Glu Cys Asn Arg Pro Glu Pro Lys Asn Gly Gly 
610 615 620 



Lys Tyr Cys Val Gly Arg Arg Met Lys Phe Lys Ser Cys Asn Thr Glu 
625 630 635 640 



Pro Cys Leu Lys Gin Lys Arg Asp Phe Arg Asp Glu Gin Cys Ala His 
645 650 655 



Phe Asp Gly Lys His Phe Asn lie Asn Gly Leu Leu Pro Asn Val Arg 
660 665 670 



Trp Val Pro Lys Tyr Ser Gly Xle Leu Met Lys Asp Arg Cys Lys Leu 
675 680 685 



Phe Cys Arg Val Ala Gly Asn Thr Ala Tyr Tyr Gin Leu Arg Asp Arg 
690 695 700 



Val lie Asp Gly Thr Pro Cys Gly Gin Asp Thr Asn Asp lie Cys Val 
705 710 715 720 



Gin Gly Leu Cys Arg Gin Ala Gly Cys Asp His Val Leu Asn Ser Lys 
725 730 735 



Ala Arg Arg Asp Lys Cys Gly Val Cys Gly Gly Asp Asn Ser Ser Cys 
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750 



Lys Thr Val Ala Gly Thr Phe Asn Thr Val His Tyr Gly Tyr Asn Thr 
755 760 765 



Val Val Arg lie Pro Ala Gly Ala Thr Asn lie Asp Val Arg Gin His 
770 775 780 



Ser Phe Ser Gly Glu Thr Asp Asp Asp Asn Tyr Leu Ala Leu Ser Ser 
785 790 795 800 



Ser Lys Gly Glu Phe Leu Leu Asn Gly Asn Phe Val Val Thr Met Ala 
805 810 815 



Lys Arg Glu lie Arg lie Gly Asn Ala Val Val Glu Tyr Ser Gly Ser 
820 825 830 



Glu Thr Ala Val Glu Arg lie Asn Ser Thr Asp Arg lie Glu G2n Glu 
835 840 845 



Leu Leu Leu Gin Val Leu Ser Val Gly Lys Leu Tyr Asn Pro Asp Val 
850 855 860 



Arg Tyr Ser Phe Asn He Pro He Glu Asp Lys Pro Gin Gin Phe Tyr 
865 870 875 880 



Trp Asn Ser His Gly Pro Trp Gin Ala Cys Ser Lys Pro Cys Gin Gly 
885 890 895 



Glu Arg Lys Arg Lys Leu Val Cys Thr Arg Glu Ser Asp Gin Leu Thr 
900 905 910 



Val Ser Asp Gin Arg Cys Asp Arg Leu Pro Gin Pro Gly His He Thr 
915 920 925 



Glu pro Cys Gly Thr Asp Cys Asp Leu Arg Trp His Val Ala Ser Arg 
930 935 940 



Ser Glu Cys Ser Ala Gin Cys Gly Leu Gly Tyr Arg Thr Leu Asp He 
945 950 955 960 



\ 



Tyr Cys Ala Lys Tyr Ser Arg Leu Asp Gly Lys Thr Glu Lys Val Asp 
965 970 975 



Asp Gly Phe Cys Ser 
980 



Ser His Pro Lys Pro Ser Asn Arg Glu Lys Cys 
985 990 
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Ser Gly Glu Cys Asn Thr Gly Gly Trp Arg Tyr Ser Ala Trp Thr Glu 
995 1000 1005 



Cys Ser Lys Ser Cys Asp Gly Gly Thr Gin Arg Arg Arg Ala lie 
1010 1015 1020 



Cys Val Asn Thr Arg Asn Asp Val Leu Asp Asp Ser Lys Cys Thr 
1025 1030 1035 



His Gin Glu Lys Val Thr He Gin Arg Cys Ser Glu Phe Pro Cys 
1040 1045 1050 



Pro Gin Trp Lys Ser Gly Asp Trp Ser Glu Val Arg Trp Glu Gly 
1055 1060 1065 



Cys Tyr Phe Pro 
1070 



<210> 61 

<211> 1356 

<212> PRT 

<213> Homo Sapiens 

<400> 61 

Met Gin Ser Lys Val Leu Leu Ala Val Ala Leu Trp Leu Cys Val Glu 
15 10 15 



Thr Arg Ala Ala Ser Val Gly Leu Pro Ser Val Ser Leu Asp Leu Pro 
20 25 30 



Arg Leu Ser He Gin Lys Asp He Leu Thr He Lys Ala Asn Thr Thr 
35 40 45 



Leu Gin lie Thr Cys Arg Gly Gin Arg Asp Leu Asp Trp Leu Trp Pro 
50 55 60 



Asn Asn Gin Ser Gly Ser Glu Gin Arg Val Glu Val Thr Glu Cys Ser 
65 70 75 80 



Asp Gly Leu Phe Cys Lys Thr Leu Thr He Pro Lys Val He Gly Asn 
85 90 95 



Asp Thr Gly Ala Tyr Lys Cys Phe Tyr Arg Glu Thr Asp Leu Ala Ser 
100 105 110 
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Val He Tyr Val Tyr Val Gin Asp Tyr Arg Ser Pro Phe He Ala Ser 
115 120 " 125 



Val Ser Asp Gin His Gly Val Val Tyr He Thr Glu Asn Lys Asn Lys 
130 135 140 



Thr Val Val He Pro Cys Leu Gly Ser lie Ser Asn lieu Asn Val Ser 
145 150 155 160 



Leu Cys Ala Arg Tyr Pro Glu Lys Arg Phe Val Pro Asp Gly Asn Arg 
165 170 175 



He Ser Trp Asp Ser Lys Lys Gly Phe Thr He Pro Ser Tyr Met He 
180 185 190 



Ser Tyr Ala Gly Met Val Phe Cys Glu Ala Lys He Asn Asp Glu Ser 
195 200 205 



Tyr Gin Ser lie Met Tyr lie Val Val Val Val Gly Tyr Arg He Tyr 
210 215 220 



Asp Val Val Leu Ser Pro Ser His Gly lie Glu Leu Ser Val Gly Glu 
225 230 235 240 



Lys Leu Val Leu Asn Cys Thr Ala Arg Thr Glu Leu Asn Val Gly lie 
245 250 255 



Asp Phe Asn Trp Glu Tyr Pro Ser Ser Lys His Gin His Lys Lys Leu 
260 265 270 



Val Asn Arg Asp Leu Lys Thr Gin Ser Gly Ser Glu Met Lys Lys Phe 
275 280 285 



Leu Ser Thr Leu Thr He Asp Gly Val Thr Arg Ser Asp Gin Gly Leu 
290 295 300 



Tyr Thr Cys Ala Ala Ser Ser Gly Leu Met Thr Lys Lys Asn Ser Thr 
305 310 315 320 



Phe Val Arg Val His Glu Lys Pro Phe Val Ala Phe Gly Ser Gly Met 
325 330 335 



Glu Ser Leu Val Glu Ala Thr Val Gly Glu Arg Val Arg He Pro Ala 
340 345 350 



Lys Tyr Leu Gly Tyr Pro Pro Pro Glu He Lys Trp Tyr Lys Asn Gly 
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355 360 365 



lie Pro Leu Glu Ser Asn His Thr lie Lys Ala Gly His Val Leu Thr 
370 375 380 



lie Met Glu Val Ser Glu Arg Asp Thr Gly Asn Tyr Thr Val lie Leu 
385 390 395 400 



Thr Asn Pro lie Ser Lys Glu Lys Gin Ser His Val Val Ser Leu Val 
405 410 415 



Val Tyr Val Pro Pro Gin He Gly Glu Lys Ser Leu He Ser Pro Val 
420 425 430 



Asp Ser Tyr Gin Tyr Gly Thr Thr Gin Thr Leu Thr Cys Thr Val Tyr 
435 440 445 



Ala He Pro Pro Pro His His He His Trp Tyr Trp Gin Leu Glu Glu 
450 455 460 



Glu Cys Ala Asn Glu Pro Ser Gin Ala Val Ser Val Thr Asn Pro Tyr 
465 470 475 480 



Pro Cys Glu Glu Trp Arg Ser Val Glu Asp Phe Gin Gly Gly Asn Lys 
485 490 495 



He Glu Val Asn Lys Asn Gin Phe Ala Leu He Glu Gly Lys Asn Lys 
500 505 510 



Thr Val Ser Thr Leu Val He Gin Ala Ala Asn Val Ser Ala Leu Tyr 
515 520 525 



Lys Cys Glu Ala Val Asn Lys Val Gly Arg Gly Glu Arg Val He Ser 
530 535 540 ~ . 



Phe His Val Thr Arg Gly Pro Glu He Thr Leu Gin Pro Asp Met Gin 
545 550 555 560 



Pro Thr Glu Gin Glu Ser Val Ser Leu Trp Cys Thr Ala Asp Arg Ser 
565 570 " 575 



Thr Phe Glu Asn Leu Thr Trp Tyr Lys Leu Gly Pro Gin Pro Leu Pro 
580 585 590 



He His Val Gly Glu Leu Pro Thr Pro Val Cys Lys Asn Leu Asp Thr 
595 600 605 
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Leu Trp Lys Leu Asn Ala Thr Met Phe Ser Asn Ser Thr Asn Asp He 
610 615 620 



Leu lie Met Glu Leu Lys Asn Ala Ser Leu Gin Asp Gin Gly Asp Tyr 
625 630 635 " 640 



Val Cys Leu Ala Gin Asp Arg Lys Thr Lys Lys Arg His Cys Val Val 
645 650 655 



Arg Gin Leu Thr Val Leu Glu Arg Val Ala Pro Thr He Thr Gly Asn 
660 665 670 



Leu Glu Asn Gin Thr Thr Ser He Gly Glu Ser He Glu Val Ser Cys 
675 680 685 



Thr Ala Ser Gly Asn Pro Pro Pro Gin lie Met Trp Phe Lys Asp Asn 
690 695 700 



Glu Thr Leu Val Glu Asp Ser Gly He Val Leu Lys Asp Gly Asn Arg 
705 710 715 720 



Asn Leu Thr He Arg Arg Val Arg Lys Glu Asp Glu Gly Leu Tyr Thr 
725 730 735 



Cys Gin Ala Cys Ser Val Leu Gly Cys Ala Lys Val Glu Ala Phe Phe 
740 745 750 



He lie Glu Gly Ala Gin Glu Lys Thr Asn Leu Glu lie He He Leu 
755 760 765 



Val Gly Thr Ala Val He Ala Met Phe Phe Trp Leu Leu Leu Val He 
770 775 780 



He Leu Arg Thr Val Lys Arg Ala Asn Gly Gly Glu Leu Lys Thr Gly 
785 790 795 ^ 800 



Tyr Leu Ser He Val Met Asp Pro Asp Glu Leu Pro Leu Asp Glu His 
805 810 8X5 



Cys Glu Arg Leu Pro Tyr Asp Ala Ser Lys Trp Glu Phe Pro Arg Asp 
820 825 830 



Arg Leu Lys Leu Gly Lys Pro Leu Gly Arg Gly Ala Phe Gly Gin Val 
835 840 ^ 845 
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He Glu Ala Asp Ala Phe Gly He Asp Lys Thr Ala Thr Cys Arg Thr 
850 855 860 



Val Ala Val Lys Met Leu Lys Glu Gly Ala Thr His Ser Glu His Arg 
865 870 875 880 



Ala Leu Met Ser Glu Leu Lys He Leu He His He Gly His His Leu 
885 890 ' 895 



Asn Val Val Asn Leu Leu Gly Ala Cys Thr Lys Pro Gly Gly Pro Leu 
900 905 910 



Met Val He Val Glu Phe Cys Lys Phe Gly Asn Leu Ser Thr Tyr Leu 
915 920 925 



Arg Ser Lys Arg Asn Glu Phe Val Pro Tyr Lys Thr Lys Gly Ala Arg 
930 935 940 



Phe Arg Gin Gly Lys Asp Tyr Val Gly Ala He Pro Val Asp Leu Lys 
945 950 955 ~ 960 



Arg Arg Leu Asp Ser He Thr Ser Ser Gin Ser Ser Ala Ser Ser Gly 
965 970 975 



Phe Val Glu Glu Lys Ser Leu Ser Asp Val Glu Glu Glu Glu Ala Pro 
980 985 990 



Glu Asp Leu Tyr Lys Asp Phe Leu Thr Leu Glu His Leu He Cys Tyr 
995 1000 1005 



Ser Phe Gin Val Ala Lys Gly Met Glu Phe Leu Ala Ser Arg Lys 
1010 1015 1020 



Cys He His Arg Asp Leu Ala Ala Arg Asn He Leu Leu Ser Glu 
1025 1030 1035 



Lys Asn Val Val Lys He Cys Asp Phe Gly Leu Ala Arg Asp He 
1040 1045 1050 



Tyr Lys Asp Pro Asp Tyr Val Arg Lys Gly Asp Ala Arg Leu Pro 
1055 1060 1065 



Leu Lys Trp Met Ala Pro Glu Thr He Phe Asp Arg Val Tyr Thr 
1070 1075 1080 
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lie Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu lie 
1085 1090 1095 

Phe Ser Leu Gly Ala Ser Pro Tyr Pro Gly Val Lys He Asp Glu 
1100 H05 111° 

Glu Phe Cys Arg Arg Leu Lys Glu Gly Thr Arg Met Arg Ala Pro 
1115 1120 ll 25 

Asp Tyr Thr Thr Pro Glu Met Tyr Gin Thr Met Leu Asp Cys Trp 
1130 H35 H40 

His Gly Glu Pro Ser Gin Arg Pro Thr Phe Ser Glu Leu Val Glu 
1145 H50 H55 

His Leu Gly Asn Leu Leu Gin Ala Asn Ala Gin Gin Asp Gly Lys 
1160 H65 H70 

Asp Tyr He Val Leu Pro He Ser Glu Thr Leu Ser Met Glu Glu 
. 1175 H80 H85 

Asp Ser Gly Leu Ser Leu Pro Thr Ser Pro Val Ser Cys Met Glu 
1190 H95 1200 

Glu Glu Glu Val Cys Asp Pro Lys Phe His Tyr Asp Asn Thr Ala 
1205 1210 1215 

Gly He Ser Gin Tyr Leu Gin Asn Ser Lys Arg Lys Ser Arg Pro 
1220 1225 1230 

Val Ser Val Lys Thr Phe Glu Asp He Pro Leu Glu Glu Pro Glu 
1235 1240 1245 

val Lys val He Pro Asp Asp Asn Gin Thr Asp Ser Gly Met Val 
1250 1255 1260 

Leu Ala Ser Glu Glu Leu Lys Thr Leu Glu Asp Arg Thr Lys Leu 
1265 1270 1275 

Ser Pro Ser Phe Gly Gly Met Val Pro Ser Lys Ser Arg Glu Ser 
1280 1285 1290 

Val Ala Ser Glu Gly Ser Asn Gin Thr Ser Gly Tyr Gin Ser Gly 
1295 1300 1305 



Tyr His Ser Asp Asp Thr Asp Thr Thr Val Tyr Ser Ser Glu Glu 
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1310 1315 1320 

Ala Glu Leu Leu Lys Leu lie Glu lie Gly Val Gin Thr Gly Ser 
1325 1330 1335 

Thr Ala Gin lie Leu Gin Pro Asp Ser Gly Thr Thr Leu Ser Ser 
1340 1345 1350 



Pro Pro Val 
1355 



<210> 62 

<211> 468 

<212> PRT 

<213> Homo sapiens 

<400> 62 

Met Gly Arg Gly Trp Gly Phe Leu Phe Gly Leu Leu Gly Ala Val Trp 

1 * 5 10 15 



Leu Leu Ser Ser Gly His Gly Glu Glu Gin Pro Pro Glu Thr Ala Ala 
20 25 30 



Gin Arg Cys Phe Cys Gin Val Ser Gly Tyr Leu Asp Asp Cys Thr Cys 
35 40 45 



Asp Val Glu Thr lie Asp Arg Phe Asn Asn Tyr Arg Leu Phe Pro Arg 
50 55 60 



Leu Gin Lys Leu Leu Glu Ser Asp Tyr Phe Arg Tyr Tyr Lys Val Asn 
65 70 75 80 



Leu Lys Arg Pro Cys Pro Phe Trp Asn Asp lie Ser Gin Cys Gly Arg 
85 90 95 



Arg Asp Cys Ala Val Lys Pro Cys Gin Ser Asp Glu Val Pro Asp Gly 
100 1 105 HO 



lie Lys Ser Ala Ser Tyr Lys Tyr Ser Glu Glu Ala Asn Asn Leu lie 
115 * 120 125 



Glu Glu Cys Glu Gin Ala Glu Arg Leu Gly Ala Val Asp Glu Ser Leu 
130 135 140 



Ser Glu Glu Thr Gin Lys Ala Val Leu Gin Trp Thr Lys His Asp Asp 
145 150 155 160 
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Ser Ser Asp Asn Phe Cys Glu Ala Asp Asp lie Gin Ser Pro Glu Ala 
165 170 175 



Glu Tyr Val Asp Leu Leu Leu Asn Pro Glu Arg Tyr Thr Gly Tyr Lys 
180 185 190 



Gly Pro Asp Ala Trp Lys lie Trp Asn Val lie Tyr Glu Glu Asn Cys 
195 200 205 



Phe Lys Pro Gin Thr lie Lys Arg Pro Leu Asn Pro Leu Ala Ser Gly 
210 215 220 



Gin Gly Thr Ser Glu Glu Asn Thr Phe Tyr Ser Trp Leu Glu Gly Leu 
225 230 235 240 



Cys Val Glu Lys Arg Ala Phe Tyr Arg Leu He Ser Gly Leu His Ala 
245 250 255 



Ser He Asn Val His Leu Ser Ala Arg Tyr Leu Leu Gin Glu Thr Trp 
260 265 270 



Leu Glu Lys Lys Trp Gly His Asn He Thr Glu Phe Gin Gin Arg Phe 
275 280 285 



Asp Gly He Leu Thr Glu Gly Glu Gly Pro Arg Arg Leu Lys Asn Leu 
290 295 300 



Tyr Phe Leu Tyr Leu He Glu Leu Arg Ala Leu Ser Lys Val Leu Pro 
305 310 315 320 



Phe Phe Glu Arg Pro Asp Phe Gin Leu Phe Thr Gly Asn Lys He Gin 
325 330 335 



Asp Glu Glu Asn Lys Met Leu Leu Leu Glu He Leu His Glu He Lys 
340 345 350 



Ser Phe Pro Leu His Phe Asp Glu Asn Ser Phe Phe Ala Gly Asp Lys 
355 360 365 



Lys Glu Ala His Lys Leu Lys Glu Asp Phe Arg Leu His Phe Arg Asn 
370 375 380 



He Ser Arg He Met Asp Cys Val Gly Cys Phe Lys Cys Arg Leu Trp 
385 390 395 400 
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Gly Lys Leu Gin Thr Gin Gly Leu Gly Thr Ala Leu Lys lie Leu Phe 
405 410 415 



Ser Glu Lys Leu lie Ala Asn Met Pro Glu Ser Gly Pro Ser Tyr Glu 
420 425 430 



Phe His Leu Thr Arg Gin Glu lie Val Ser Leu Phe Asn Ala Phe Gly 
435 440 445 



Arg He Ser Thr Ser Val Lys Glu Leu Glu Asn Phe Arg Asn Leu Leu 
450 455 460 



Gin Asn He His 
465 



<210> 63 

<211> 228 

<212> PRT 

<213> Homo Sapiens 

<400> 63 

Met Gin Pro Arg Arg Gin Arg Leu Pro Ala Pro Trp Ser Gly Pro Arg 
1 5 10 15 



Gly Pro Arg Pro Thr Ala Pro Leu Leu Ala Leu Leu Leu Leu Leu Ala 
20 25 30 



Pro Val Ala Ala Pro Ala Gly Ser Gly Gly Pro Asp Asp Pro Gly Gin 
35 40 45 



Pro Gin Asp Ala Gly Val Pro Arg Arg Leu Leu Gin Gin Lys Ala Arg 
50 55 60 



Ala Ala Leu His Phe Phe Asn Phe Arg Ser Gly Ser Pro Ser Ala Leu 
65 70 75 80 



Arg Val Leu Ala Glu Val Gin Glu Gly Arg Ala Trp He Asn Pro Lys 
85 90 95 



Glu Gly Cys Lys Val His Val Val Phe Ser Thr Glu Arg Tyr Asn Pro 
100 105 110 



Glu Ser Leu Leu Gin Glu Gly Glu Gly Arg Leu Gly Lys Cys Ser Ala 
115 120 125 



Arg Val Phe Phe Lys Asn Gin Lys Pro Arg Pro Thr He Asn Val Thr 
130 135 140 . 
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Cys Thr Arg Leu He Glu Lye Lys Lys Arg Gin Gin Glu Asp Tyr Leu 
145 150 155 160 

Leu Tyr Lys Gin Met Lys Gin Leu Lys Asn Pro Leu Glu He Val Ser 
165 170 175 

He Pro Asp Asn His Gly His He Asp Pro Ser Leu Arg Leu He Trp 
180 i 8 5 lgo 

Asp Leu Ala Phe Leu Gly Ser Ser Tyr Val Met Trp Glu Met Thr Thr 
195 200 205 

Gin Val Ser His Tyr Tyr Leu Ala Gin Leu Thr Ser Val Arg Gin Trp 
210 215 22 0 

Val Arg Lys Thr 
225 



<210> 64 

<211> 747 

<212> PRT 

<213> Homo Sapiens 

<400> 64 

Met Arg Arg Cys Asn Ser Gly Ser Gly Pro Pro Pro Ser Leu Leu Leu 
15 10 15 

Leu Leu Leu Trp Leu Leu Ala Val Pro Gly Ala Asn Ala Ala Pro Arq 
20 25 30 

Ser Ala Leu Tyr Ser Pro Ser Asp Pro Leu Thr Leu Leu Gin Ala Asp 
35 40 45 

Thr Val Arg Gly Ala Val Leu Gly Ser Arg Ser Ala Trp Ala Val Glu 
50 55 60 



Phe Phe Ala Ser Trp Cys Gly His Cys He Ala Phe Ala Pro Thr Tro 
65 70 75 60 

Lys Ala Leu Ala Glu Asp Val Lys Ala Trp Arg Pro Ala Leu Tyr Leu 
85 90 95 

Ala Ala Leu Asp Cys Ala Glu Glu Thr Asn Ser Ala Val Cys Arg Asp 
100 105 no 



WO 2004/073657 



128/282 



PCT/US2004/005455 



Phe Asn lie Pro Gly Phe Pro Thr Val Arg Phe Phe Lys Ala Phe Thr 
115 120 125 



Lys Asn Gly Ser Gly Ala Val Phe Pro Val Ala Gly Ala Asp Val Gin 
130 135 140 



Thr Leu Arg Glu Arg Leu lie Asp Ala Leu Glu. Ser His His Asp Thr 
145 150 155 160 



Trp Pro Pro Ala Cys Pro Pro Leu Glu Pro Ala Lys Leu Glu Glu lie 
165 170 175 



Asp Gly Phe Phe Ala Arg Asn Asn Glu Glu Tyr Leu Ala Leu He Phe 
180 185 190 



Glu Lya Gly Gly Ser Tyr Leu Gly Arg Glu Val Ala Leu Asp Leu Ser 
195 200 205 



Gin His Lys Gly Val Ala Val Arg Arg Val Leu Asn Thr Glu Ala Asn 
210 215 220 



Val Val Arg Lys Phe Gly Val Thr Asp Phe Pro Ser Cys Tyr Leu Leu 
225 230 235 240 



Phe Arg Asn Gly Ser Val Ser Arg Val Pro Val Leu Met Glu Ser Arg 
245 250 255 



Ser Phe Tyr Thr Ala Tyr Leu Gin Arg Leu Ser Gly Leu Thr Arg Glu 
260 265 270 



Ala Ala Gin Thr Thr Val Ala Pro Thr Thr Ala Asn Lys He Ala Pro 
275 280 285 



Thr Val Trp Lys Leu Ala Asp Arg Ser Lys He Tyr Met Ala Asp Leu 
290 295 300 



Glu Ser Ala Leu His Tyr He Leu Arg lie Glu Val Gly Arg Phe Pro 
305 310 315 ' 320 



Val Leu Glu Gly Gin Arg Leu Val Ala Leu Lys Lys Phe Val Ala Val 
325 330 335 



Leu Ala Lys Tyr Phe Pro Gly Arg Pro Leu Val Gin Asn Phe Leu His 
340 345 350 



Ser Val Asn Glu Trp Leu Lys Arg Gin Lys Arg Asn Lys He Pro Tyr 
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355 



360 



365 



Ser Phe Phe Lys Thr Ala Leu Asp Asp Arg Lys Glu Gly Ala Val Leu 
370 375 380 



Ala Lys Lys Val Asn Trp lie Gly Cys Gin Gly Ser Glu Pro His Phe 
385 390 395 400 



Arg Gly Phe Pro Cys Ser Leu Trp Val Leu phe His Phe Leu Thr Val 
405 410 415 



Gin Ala Ala Arg Gin Asn Val Asp His Ser Gin Glu Ala Ala Lys Ala 
420 425 430 



Lys Glu Val Leu Pro Ala lie Arg Gly Tyr Val His Tyr Phe Phe Gly 
435 440 445 



Cys Arg Asp Cys Ala Ser His Phe Glu Gin Met Ala Ala Ala Ser Met 
450 455 460 



His Arg Val Gly Ser Pro Asn Ala Ala Val Leu Trp Leu Trp Ser Ser 
465 470 475 480 



His Asn Arg Val Asn Ala Arg Leu Ala Gly Ala Pro Ser Glu Asp Pro 
485 490 495 



Gin Phe Pro Lys Val Gin Trp Pro Pro Arg Glu Leu Cys Ser Ala Cys 
500 505 510 



His Asn Glu Arg Leu Asp Val Pro Val Trp Asp Val Glu Ala Thr Leu 
515 520 525 



Asn Phe Leu Lys Ala His Phe Ser Pro Ser Asn He He Leu Asp Phe 
530 535 540 



Pro Ala Ala Gly Ser Ala Ala Arg Arg Asp Val Gin Asn Val Ala Ala 
545 550 555 560 



Ala Pro Glu Leu Ala Met Gly Ala Leu Glu Leu Glu Ser Arg Asn Ser 
565 570 575 



Thr Leu Asp Pro Gly Lys Pro Glu Met Met Lys Ser Pro Thr Asn Thr 
580 585 590 



Thr Pro His Val Pro Ala Glu Gly Pro Glu Ala Ser Arg Pro Pro Lys 
595 600 605 
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Leu His Pro Gly Leu Arg Ala Ala Pro Gly Gin Glu Pro Pro Glu His 
610 615 620 



Met Ala Glu Leu Gin Arg Asn Glu Gin Glu Gin Pro Leu Gly Gin Trp 
625 630 635 640 



His Leu Ser Lys Arg Asp Thr Gly Ala Ala Leu Leu Ala Glu Ser Arg 
645 650 655 



Ala Glu Lys Asn Arg Leu Trp Gly Pro Leu Glu Val Arg Arg Val Gly 
660 665 670 



Arg Ser Ser Lys Gin Leu Val Asp lie Pro Glu Gly Gin Leu Glu Ala 
675 680 685 



Arg Ala Gly Arg Gly Arg Gly Gin Trp Leu Gin Val Leu Gly Gly Gly 
690 695 700 



Phe Ser Tyr Leu Asp lie Ser Leu Cys Val Gly Leu Tyr Ser Leu Ser 
705 710 715 720 



Phe Met Gly Leu Leu Ala Met Tyr Thr Tyr Phe Gin Ala Lys He Arg 
725 730 735 



Ala Leu Lys Gly His Ala Gly His Pro Ala Ala 
740 745 



<210> 65 

<211> 1163 

<212> PRT 

<213> Homo Sapiens 

<400> 65 

Met Val Trp Cys Leu Gly Leu Ala Val Leu Ser Leu Val He Ser Gin 
1 5 10 15 



Gly Ala Asp Gly Arg Gly Lys Pro Glu Val Val Ser Val Val Gly Arg 
20 25 30 



Ala Glu Glu Ser Val Val Leu Gly Cys Asp Leu Leu Pro Pro Ala Gly 
35 40 45 



Arg Pro Pro Leu Hie Val He Glu Trp Leu Arg Phe Gly Phe Leu Leu 
50 55 60 
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Pro lie Phe lie Gin Phe Gly Leu Tyr Ser Pro Arg lie Asp Pro Asp 
65 70 75 80 



Tyr Val Gly Arg Val Arg Leu Gin Lys Gly Ala Ser Leu Gin lie Glu 
85 90 95 



Gly Leu Arg Val Glu Asp Gin Gly Trp Tyr Glu Cys Arg Val Phe Phe 
100 105 110 



Leu Asp Gin His lie Pro Glu Asp Asp Phe Ala Asn Gly Ser Trp Val 
115 120 125 



His Leu Thr Val Asn Ser Pro Pro Gin Phe Gin Glu Thr Pro Pro Ala 
130 135 140 



Val Leu Glu Val Gin Glu Leu Glu Pro Val Thr Leu Arg Cys Val Ala 
145 150 155 ~ ~ 160 



Arg Gly Ser Pro Leu Pro His Val Thr Trp Lys Leu Arg Gly Lys Asp 
165 170 ' ~ 175 



Leu Gly Gin Gly Gin Gly Gin Val Gin Val Gin Asn Gly Thr Leu Arg 
180 185 190 



lie Arg Arg Val Glu Arg Gly Ser Ser Gly Val Tyr Thr Cys Gin Ala 
195 200 205 



Ser Ser Thr Glu Gly Ser Ala Thr His Ala Thr Gin Leu Leu Val Leu 
210 215 220 



Gly Pro Pro Val lie Val Val Pro Pro Lys Asn Ser Thr Val Asn Ala 
225 230 235 240 



Ser Gin Asp Val Ser Leu Ala Cys His Ala Glu Ala Tyr Pro Ala Asn 
245 250 255 



Leu Thr Tyr Ser Trp Phe Gin Asp Asn lie Asn Val Phe His He Ser 
260 265 270 



Arg Leu Gin Pro Arg Val Gin He Leu Val Asp Gly Ser Leu Arg Leu 
275 280 285 



Leu Ala Thr Gin Pro Asp Asp Ala Gly Cys Tyr Thr Cys Val Pro Ser 
290 295 300 



Asn Gly Leu Leu His Pro Pro Ser Ala Ser Ala Tyr Leu Thr Val Leu 
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305 310 315 320 



Cys Met Pro Gly Val lie Arg Cys Pro Val Arg Ala Asn Pro Pro Leu 
325 330 335 



Leu Phe' Val Ser Trp Thr Lys Asp Gly Lys Ala Leu Gin Leu Asp Lys 
340 345 35.0 



Phe Pro Gly Trp Ser Gin Gly Thr Glu Gly Ser Leu lie He Ala Leu 
355 360 365 



Gly Asn Glu Asp Ala Leu Gly Glu Tyr Ser Cys Thr Pro Tyr Asn Ser 
370 375 380 



Leu Gly Thr Ala Gly Pro Ser Pro Val Thr Arg Val Leu Leu Lys Ala 
385 390 395 400 



Pro Pro Ala Phe He Glu Arg Pro Lys Glu Glu Tyr Phe Gin Glu Val 
405 410 415 



Gly Arg Glu Leu Leu He Pro Cys Ser Ala Gin Gly Asp Pro Pro Pro 
420 425 430 



Val Val Ser Trp Thr Lys Val Gly Arg Gly Leu Gin Gly Gin Ala Gin 
435 440 445 



Val Asp Ser Asn Ser Ser Leu He Leu Arg Pro Leu Thr Lys Glu Ala 
450 455 460 



His Gly His Trp Glu Cys Ser Ala Ser Asn Ala Val Ala Arg Val Ala 
465 470 475 480 



Thr Ser Thr Asn Val Tyr Val Leu Gly Thr Ser Pro His Val Val Thr 
485 490 495 



Asn Val Ser Val Val Ala Leu Pro Lys Gly Ala Asn Val Ser Trp Glu 
500 505 510 



Pro Gly Phe Asp Gly Gly Tyr Leu Gin Arg Phe Ser Val Trp Tyr Thr 
515 520 525 



Pro Leu Ala Lys Arg Pro Asp Arg Met His His Asp Trp Val Ser Leu 
530 535 540 



Ala Val Pro Val Gly Ala Ala His Leu Leu Val Pro Gly Leu Gin Pro 
545 550 555 560 
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His Thr Gin Tyr Gin Phe Ser Val Leu Ala Gin Asn Lys Leu Gly Ser 
565 570 575 



Gly Pro Phe Ser Glu lie Val Leu Ser Ala Pro Glu Gly Leu Pro Thr 
580 585 590 



Thr Pro Ala Ala Pro Gly Leu Pro Pro Thr Glu lie Pro Pro Pro Leu 
595 600 605 



Ser Pro Pro Arg Gly Leu Val Ala Val Arg Thr Pro Arg Gly Val Leu 
610 615 620 



Leu His Trp Asp Pro Pro Glu Leu Val Pro Lys Arg Leu Asp Gly Tyr 
625 630 635 640 



Val Leu Glu Gly Arg Gin Gly Ser Gin Gly Trp Glu Val Leu Asp Pro 
645 650 655 



Ala Val Ala Gly Thr Glu Thr Glu Leu Leu Val Pro Gly Leu lie Lys 
660 665 670 



Asp Val Leu Tyr Glu Phe Arg Leu Val Ala Phe Ala Gly Ser Phe Val 
675 680 685 



Ser Asp Pro Ser Asn Thr Ala Asn Val Ser Thr Ser Gly Leu Glu Val 
690 695 700 



Tyr Pro Ser Arg Thr Gin Leu Pro Gly Leu Leu Pro Gin Pro Val Leu 
705 710 715 720 



Ala Gly Val Val Gly Gly Val Cys Phe Leu Gly Val Ala Val Leu Val 
725 730 735 



Ser He Leu Ala Gly Cys Leu Leu Asn Arg Arg Arg Ala Ala Arg Arg 
74 0 745 750 



Arg Arg Lys Arg Leu Arg Gin Asp Pro Pro Leu He Phe Ser Pro Thr 
755 760 765 



Gly Lys Ser Ala Ala Pro Ser Ala Leu Gly Ser Gly Ser Pro Asp Ser 
770 775 780 



Val Ala Lys Leu Lys Leu Gin Gly Ser Pro Val Pro Ser Leu Arg Gin 
785 790 795 800 
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Ser Leu Leu Trp Gly Asp Pro Ala GIy Thr Pro Ser PrQ His prQ 
5 810 815 

Pro Pro Ser Ser Arg Gly Pro Leu Pro Leu ^ prQ ^ ^ ^ 

825 830 

Pro Asp Gly Arg Phe Val Met Gly Pro Thr Val Ala Ala Pro Gin Glu 

840 845 

Arg Ser Gly Arg Glu Gin Ala Glu Pro Arg Thr Pro Ala Gin Arg Leu 

B55 860 

S Ser S6r Ser *«> Ser Gly Ala Pro Gin 

870 875 880 

Pro Leu Cys lie Glu Asp He Ser Pro Val Ala Pro Pro Pro Ala Ala 
885 890 895 

Pro Pro ser Pro Le u Pro Gly Pro Gly Pro Leu Leu Gin Tyr Leu Ser 



905 



910 



Leu Pro Phe Phe Arg Glu Met Asn Val acn m * 

915 JJJ Asp Gly As P Tr P Pr o Pro Leu 



925 



Olu Glu Pro Ser P ro A la Ala Pro Pro Asp Tyr Met Asp Thr Arg Arg 

935 94Q 

Cys Pro Thr Ser Ser Phe Leu Arg Ser Pro Glu Thr Pro Pro Val Ser 

950 960 

Pro Arg Glu Ser Leu Pro Gly Ala Val Val Gly Ala Gly Ala Thr Ala 

970 975 

Glu Pro Pro Tyr Thr Ala Leu Ala Asp Trp Thr Leu Arg Glu Arg Leu 

985 99Q 

Leu Pro Gly Leu Leu Pro Ala Ala Pro Arg Gly Ser Leu Thr Ser Gin 



1000 



1005 



Ser Bar Gly Arg Gly Ser Ala Ser Phe Leu Arg Pro Pro Ser Thr 

1015 1020 

Ala Pro ser Ala Gly Gly jer Tyr Leu Ser Pro Ala Pro Gly Asp 

1030 1Q35 



WO 2004/073657 PCT/US2004/005455 

135/282 

Thr Ser Ser Trp Ala Ser Gly Pro Glu Arg Trp Pro Arg Arg Glu 

1040 1045 ~ 1050 



His Val Val Thr Val Ser Lys Arg Arg Asn Thr Ser Val Asp Glu 
1055 1060 1065 



Asn Tyr Glu Trp Asp Ser Glu Phe Pro Gly Asp. Met Glu Leu Leu 
1070 1075 1080 



Glu Thr Leu His Leu Gly Leu Ala Ser Ser Arg Leu Arg Pro Glu 
1085 1090 1095 



Ala Glu Thr Glu Leu Gly Val Lys Thr Pro Glu Glu Gly Cys Leu 
1100 1105 1110 



Leu Asn Thr Ala His Val Thr Gly Pro Glu Ala Arg Cys Ala Ala 
1115 1120 1125 



Leu Arg Glu Glu Phe Leu Ala Phe Arg Arg Arg Arg Asp Ala Thr 
1130 1135 1140 



Arg Ala Arg Leu Pro Ala Tyr Arg Gin Pro Val Pro His Pro Glu 
1145 1150 1155 



Gin Ala Thr Leu Leu 
1160 



<210> 66 

<211> 87 

<212> PRT 

<213> Homo Sapiens 

<400> 66 

Met Ala Gly Ala Ser Leu Gly Ala Arg Phe Tyr Arg Gin lie Lys Arg 
15 10 15 



His Pro Gly lie lie Pro Met lie Gly Leu lie Cys Leu Gly Met Gly 
20 25 30 



Ser Ala Ala Leu Tyr Leu Leu Arg Leu Ala Leu Arg Ser Pro Asp Val 
35 40 45 



Cys Trp Asp Arg Lys Asn Asn Pro Glu Pro Trp Asn Arg Leu Ser Pro 
50 55 60 



Asn Asp Gin Tyr Lys Phe Leu Ala Val Ser Thr Asp Tyr Lys Lys Leu 
65 70 75 80 
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Lys Lys Asp Arg Pro Asp Phe 
85 



<210> 67 

<211> 1241 

<212> PRT 

<213> Homo Sapiens 

<4Q0> 67 

Met lie Met Phe Pro Leu Phe Gly Lys lie Ser Leu Gly He Leu lie 
5 10 15 

Phe val Leu lie olu Gly Asp Phe Pro Ser Leu Thr Ala Gin Thr Tvr 
20 .25 30 



Leu Ser lie olu Glu He- Gin Glu Pro Lys Ser Ala Val Ser Phe Leu 
35 40 45 

Leu Pro Glu Glu Ser Thr Asp Leu Ser Leu Ala Thr Lys Lys Lys Gin 



60 



Pro Leu Asp Arg Arg Glu Thr Glu Arg Gin Trp Leu He Arg Arg Arq 
65 70 75 y 80 9 

Arg ser He Leu Phe Pro Asn Gly Val Lys He Cys Pro Asp Glu Ser 
85 90 95 

Val Ala Glu Ala Val Ala Asn His Val Lys Tyr Phe Lys Val Arg Val 

105 i 10 

Cys Gin Glu Ala Val Trp Glu Ala Phe Arg Thr Phe Trp Asp Arg 
115 120 a 



Leu 

125 



Pro Gly Arg Glu Glu Tyr His Tyr Trp Met Asn Leu Cys Glu Asp Gly 

135 140 

Val Thr Ser He Phe Glu Met Gly Thr Asn Phe Ser Glu Ser Val Glu 

150 155 1SQ 

His Arg Ser Leu lie Met Lys Lys Leu Thr Tyr Ala Lys Glu Thr Val 
165 170 175 

Ser Ser Ser Glu Leu Ser Ser Pro Val Pro Val Gly Asp Thr Ser Thr 
180 185 190 
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Leu Gly Asp Thr Thr Leu Ser Val Pro His Pro Glu Val Asp Ala Tyr 
195 200 205 



Glu Gly Ala Ser Glu Ser Ser Leu Glu Arg Pro Glu Glu Ser lie Ser 
210 215 220 



Asn Glu lie Glu Asn Val lie Glu Glu Ala Thr Lys Pro Ala Gly Glu 
225 230 235 240 



Gin lie Ala Glu Phe Ser He His Leu Leu Gly Lys Gin Tyr Arg Glu 
245 250 255 



Glu Leu Gin Asp Ser Ser Ser Phe His His Gin His Leu Glu Glu Glu 
260 265 270 



Phe He Ser Glu Val Glu Asn Ala Phe Thr Gly Leu Pro Gly Tyr Lys 
275 280 285 



Glu He Arg Val Leu Glu Phe Arg Ser Pro Lys Glu Asn Asp Ser Gly 
290 295 300 



Val Asp Val Tyr Tyr Ala Val Thr Phe Asn Gly Glu Ala lie Ser Asn 
305 310 % 315 . 320 



Thr Thr Trp Asp Leu He Ser Leu His Ser Asn Lys Val Glu Asn His 
325 330 335 



Gly Leu Val Glu Leu Asp Asp Lys Pro Thr Val Val Tyr Thr He Ser 
340 345 ^ 350 



Asn Phe Arg Asp Tyr He Ala Glu Thr Leu Gin Gin Asn Phe Leu Leu 
355 360 365 



Gly Asn Ser Ser Leu Asn Pro Asp Pro Asp Ser Leu Gin Leu He Asn 
370 375 380 



Val Arg Gly Val Leu Arg His Gin Thr Glu Asp Leu Val Trp Asn Thr 
385 390 395 * 400 



Gin Ser Ser Ser Leu Gin Ala Thr Pro Ser Ser He Leu Asp Asn Thr 
405 410 415 



Phe Gin Ala Ala Trp Pro Ser Ala Asp Glu Ser He Thr Ser Ser He 
420 425 430 



Pro Pro Leu Asp Phe Ser Ser Gly Pro Pro Ser Ala Thr Gly Arg Glu 
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440 
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445 



Leu Trp Ser Glu Ser Pro Leu Gly Asp Leu Val Ser Thr His Lys Leu 
450 455 460 



Ala Phe Pro Ser Lys Met Gly Leu Ser Ser Ser Pro Glu Val Leu Glu 
465 470 475 480 



Val Ser Ser Leu Thr Leu His Ser Val Thr Pro Ala Val Leu Gin Thr 
485 490 495 



Gly Leu Pro Val Ala Ser Glu Glu Arg Thr Ser Gly Ser His Leu Val 
500 505 510 



Glu Asp Gly Leu Ala Asn Val Glu Glu Ser Glu Asp Phe Leu Ser lie 
515 520 525 



Asp Ser Leu Pro Ser Ser Ser Phe Thr Gin Pro Val Pro Lys Glu Thr 
530 535 540 



He Pro Ser Met Glu Asp Ser Asp Val Ser Leu Thr Ser Ser Pro Tyr 
545 550 555 560 



Leu Thr Ser Ser He Pro Phe Gly Leu Asp Ser Leu Thr Ser Lys Val 
565 570 575 



Lys Asp Gin Leu Lys Val Ser Pro Phe Leu Pro Asp Ala Ser Met Glu 
580 585 590 



Lys Glu Leu He Phe Asp Gly Gly Leu Gly Ser Gly Ser Gly Gin LyB 
595 600 605 



Val Asp Leu He Thr Trp Pro Trp Ser Glu Thr Ser Ser Glu Lys Ser 
610 615 620 



Ala Glu Pro Leu Ser Lys Pro Trp Leu Glu Asp Asp Asp Ser Leu Leu 
625 630 635 640 



Pro Ala Glu He Glu Asp Lys Lys Leu Val Leu Val Asp Lys Met Asp 
645 650 655 



Ser Thr Asp Gin He Ser Lys His Ser Lys Tyr Glu His Asp Asp Arg" 
660 665 670 



Ser Thr His Phe pro Glu Glu Glu Pro Leu Ser Gly Pro Ala Val Pro 
675 680 685 



WO 2004/073657 



139/282 



PCT/US2004/005455 



lie Phe Ala Asp Thr Ala Ala Glu Ser Ala Ser Leu Thr Leu Pro Lys 
690 695 700 



His lie Ser Glu Val Pro Gly Val Asp Asp Cys Ser Val Thr Lys Ala 
705 710 715 720 



Pro Leu lie Leu Thr Ser Val Ala lie Ser Ala Ser Thr Asp Lys Ser 
725 730 735 



Asp Gin Ala Asp Ala lie Leu Arg Glu Asp Met Glu Gin Xle Thr Glu 
740 745 750 



Ser Ser Asn Tyr Glu Trp Phe Asp Ser Glu Val Ser Met Val Lys Pro 
755 760 765 



Asp Met Gin Thr Leu Trp Thr lie Leu Pro Glu Ser Glu Arg Val Trp 
770 775 780 



Thr Arg Thr Ser Ser Leu Glu Lys Leu Ser Arg Asp He Leu Ala Ser 
785 790 795 800 



Thr Pro Gin Ser Ala Asp Arg Leu Trp Leu Ser Val Thr Gin Ser Thr 
805 810 815 



Lys Leu Pro Pro Thr Thr He Ser Thr Leu Leu Glu Asp Glu Val He 
820 825 830 



Met Gly Val Gin Asp He Ser Leu Glu Leu Asp Arg He Gly Thr Asp 
835 840 845 



Tyr Tyr Gin Pro Glu Gin Val Gin Glu Gin Asn Gly Lys Val Gly Ser 
850 855 860 



Tyr Val Glu Met Ser Thr Ser Val His Ser Thr Glu Met Val Ser Val 
865 870 875 880 



Ala Trp Pro Thr Glu Gly Gly Asp Asp Leu Ser Tyr Thr Gin Thr Ser 
885 890 895 



Gly Ala Leu Val Val Phe Phe Ser Leu Arg Val Thr Asn Met Met Phe 
900 905 910 



Ser Glu Asp Leu Phe Asn Lys Asn Ser Leu Glu Tyr Lys Ala Leu Glu 
915 920 925 
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Gin Arg Phe Leu Glu Leu Leu Val Pro Tyr Leu Gin Ser Asn Leu Thr 
930 935 940 



Gly Phe Gin Asn Leu Glu lie Leu Asn Phe Arg Asn Gly Ser lie Val 
945 950 955 960 



Val Asn Ser Arg Met Lys Phe Ala Asn Ser Val Pro Pro Asn Val Asn 
965 970 975 



Asn Ala Val Tyr Met lie Leu Glu Asp Phe Cys Thr Thr Ala Tyr Asn 
980 985 990 



Thr Met Asn Leu Ala He Asp Lys Tyr Ser Leu Asp Val Glu Ser Gly 
995 1000 1005 



Asp Glu Ala Asn Pro Cys Lys Phe Gin Ala Cys Asn Glu Phe Ser' 
1010 1015 1020 



Glu Cys Leu Val Asn Pro Trp Ser Gly Glu Ala Lys Cys Arg Cys 
1025 1030 1035 



Phe Pro Gly Tyr Leu Ser Val Glu. Glu Arg Pro Cys Gin Ser Leu 
1040 1045 1050 



Cys Asp Leu Gin Pro Asp Phe Cys Leu Asn Asp Gly Lys Cys Asp 
1055 1060 1065 



He Met Pro Gly His Gly Ala He Cys Arg Cys Arg Val Gly Glu 
1070 1075 1080 



Asn Trp Trp Tyr Arg Gly Lys His Cys Glu Glu Phe Val Ser Glu 
1085 1090 1095 



Pro Val He lie Gly lie Thr He Ala Ser Val Val Gly Leu Leu 
1100 1105 mo 



Val lie Phe Ser Ala lie lie Tyr Phe Phe He Arg Thr Leu Gin 
1115 1120 H25 



Ala His His Asp Arg Ser Glu Arg Glu Ser Pro Phe Ser Gly Ser 
1130 1135 H40 



Ser Arg Gin Pro Asp Ser Leu Ser Ser He Glu Asn Ala Val Lys 
1145 1150 H55 
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Tyr Asn Pro Val Tyr Glu Ser His Arg Ala Gly Cys Glu Lys Tyr 
1160 H65 H70 

Glu Gly Pro Tyr Pro Gin His Pro Phe Tyr Ser Ser Ala Ser Gly 
1175 H80 1185 

Asp Val He Gly Gly Leu Ser Arg Glu Glu He Arg Gin Met Tyr 
1190 H95 1200 

Glu Ser Ser Glu Leu Ser Arg Glu Glu He Gin Glu Arg Met Arg 
1205 1210 1215 

Val Leu Glu Leu Tyr Ala Asn Asp Pro Glu Phe Ala Ala Phe Val 
1220 1225 1230 

Arg Glu Gin Gin Val Glu Glu Val 
1235 1240 



<210> 68 

<211> 211 

<212> PRT 

<213> Homo Sapiens 

<400> 68 

Met Ala Asn Ala Gly Leu Gin Leu Leu Gly Phe He Leu Ala Phe Leu 
1 5 io 15 



Gly Trp He Gly Ala He Val Ser Thr Ala Leu Pro Gin Trp Arg He 
20 25 30 



Tyr Ser Tyr Ala Gly Asp Asn He Val Thr Ala Gin Ala Met Tyr Glu 
35 40 45 



Gly Leu Trp Met Ser Cys Val Ser Gin Ser Thr Gly Gin He Gin Cys 
50 55 60 



Lys Val Phe Asp Ser Leu Leu Asn Leu Ser Ser Thr Leu Gin Ala Thr 
65 70 75 80 



Arg Ala Leu Met Val Val Gly He Leu Leu Gly Val He Ala He Phe 
85 90 95 



Val Ala Thr Val Gly Met Lys Cys Met Lys Cys Leu Glu Asp Asp Glu 
100 105 no 



Val Gin Lys Met Arg Met Ala Val He Gly Gly Ala He Phe Leu Leu 
115 120 125 
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Ala Gly Leu Ala lie Leu Val Ala Thr Ala Trp Tyr Gly Asn Arg lie 
130 135 140 



Val Gin Glu Phe Tyr Asp Pro Met Thr Pro Val Asn Ala Arg Tyr Glu 
145 150 155 160 



Phe Gly Gin Ala Leu Phe Thr Gly Trp Ala Ala Ala Ser Leu Cys Leu 
165 170 175 



Leu Gly Gly Ala Leu Leu Cys Cys Ser Cys Pro Arg Lys Thr Thr Ser 
180 185 190 



Tyr Pro Thr Pro Arg Pro Tyr Pro Lys Pro Ala Pro Ser Ser Gly Lys 
195 200 205 



Asp Tyr Val 
210 



<210> 69 

<211> 360 

<212> PRT 

<213> Homo Sapiens 

<400> 69 

Met Asp Leu His Leu Phe Asp Tyr Ser Glu Pro Gly Asn Phe Ser Asp 
15 10 15 



He Ser Trp Pro Cys Asn Ser Ser Asp Cys He Val Val Asp Thr Val 
20 25 30 



Met Cys Pro Asn Met Pro Asn Lys Ser Val Leu Leu Tyr Thr Leu Ser 
35 40 45 



Phe He Tyr He Phe He Phe Val He Gly Met He Ala Asn Ser Val 
50 ~ 55 60 



Val Val Trp Val Asn He Gin Ala Lys Thr Thr Gly Tyr Asp Thr His 
65 70 75 80 



Cys Tyr He Leu Asn Leu Ala lie Ala Asp Leu Trp Val Val Leu Thr 
85 90 95 



He Pro Val Trp Val Val Ser Leu Val Gin His Asn Gin Trp Pro Met 
100 105 110 
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Gly Glu Leu Thr Cys Lys Val Thr His Leu He Phe Ser lie Asn Leu 
115 120 125 

Phe Gly Ser He Phe Phe Leu Thr Cys Met Ser Val Asp Arg Tyr Leu 
130 135 140 

Ser He Thr Tyr Phe Thr Asn Thr Pro Ser Ser Arg Lys Lys Met Val 
145 150 155 160 

Arg Arg Val Val Cys He Leu Val Trp Leu Leu Ala Phe Cys Val Ser 
165 170 175 



Leu Pro Asp Thr Tyr Tyr Leu Lys Thr Val Thr Ser Ala Ser Asn Asn 
180 185 190 



Glu Thr Tyr Cys Arg Ser Phe Tyr Pro Glu His Ser He Lys Glu Trp 
195 200 205 



Leu He Gly Met Glu Leu Val Ser Val Val Leu Gly Phe Ala Val Pro 
210 215 220 



Phe Ser He He Ala Val Phe Tyr Phe Leu Leu Ala Arg Ala He Ser 
225 230 235 240 



Ala Ser Ser Asp Gin Glu Lys His Ser Ser Arg Lys He He Phe Ser 
245 250 255 



Tyr Val Val Val Phe Leu Val Cys Trp Leu Pro Tyr His Val Ala Val 
260 265 270 



Leu Leu Asp He Phe Ser He Leu His Tyr He Pro Phe Thr Cys Arg 
275 2B0 285 



Leu Glu His Ala Leu Phe Thr Ala Leu His Val Thr Gin Cys Leu Ser 
. 290 295 300 



Leu Val His Cys Cys Val Asn Pro Val Leu Tyr Ser Phe He Asn Arg 
305 310 315 320 



Asn Tyr Arg Tyr Glu Leu Met Lys Ala Phe He Phe Lys Tyr Ser Ala 
325 330 335 



Lys Thr Gly Leu Thr Lys Leu He Asp Ala Ser Arg Val Ser Glu Thr 
340 345 350 



Glu Tyr Ser Ala Leu Glu Gin Ser 
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355 



360 



<210> 70 

<211> 2273 

<212> PRT 

<213> Homo Sapiens 

<400> 70 



Thr 



Met Gly Phe Val Arg Gln lle Gln Leu Leu Leu Trp ^ ^ 

10 15 

Leu Arg Lys Ar g Gln Lys Ile Arg phe ^ ^ ^ ^ ^ ^ ^ 

25 30 

Leu Ser Leu Phe Leu Val Leu Trp Leu Arg Asn ^ ^ 

Tyr Ser His His Glu cys His Phe Pro Asn Lys Ala Met Pro Ser Ala 

55 60 

Gly Met Leu Pro Trp Leu Gln Gly He Phe Cys Asn Val Asn Asn Pro 

70 75 80 

Cys Phe Gln Ser Pro ^ Pro Gly fla ?er pro Qly ^ ^ ^ ^ 



90 95 



XVr Asn Asn Ser He Leu Ala Arg Val Tyr Arg Asp Phe Gln Glu Leu 



110 



Leu Met Asn Ala Pro Glu Ser Gln His Leu Gly Arg He Trp Thr Glu 

120 125 

Leu His lie Leu Ser Gln Phe Met Asp Thr Leu Arg Thr His Pro Glu 

135 140 

Arg lie Ala Gly Arg Gly He Arg He Arg Asp He Leu Lys Asp Glu 

i55 160 

Glu Thr Leu Thr Leu Phe Leu He Lys Asn He Gly Leu Ser Asp Ser 

170 175 

Val Val Tyr Leu Leu He Asn Ser Gln Val Arg Pro Glu Gln Phe Ala 

185 190 

His Gly Val Pro Asp Leu Ala Leu Lys Asp H e Ala Cys Ser Glu Ala 

200 205 
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Leu Leu Glu Arg Phe lie lie Phe Ser Gin Arg Arg Gly Ala Lys Thr 
210 215 220 



Val Arg Tyr Ala Leu Cys Ser Leu Ser Gin Gly Thr Leu Gin Trp lie 
225 230 235 240 



Glu Asp Thr Leu Tyr Ala Asn Val Asp Phe Phe Lys Leu Phe Arg Val 
245 250 255 



Leu Pro Thr Leu Leu Asp Ser Arg Ser Gin Gly He Asn Leu Arg Ser 
260 265 270 



Trp Gly Gly He Leu Ser Asp Met Ser Pro Arg He Gin Glu Phe lie 
275 280 285 



His Arg Pro Ser Met Gin Asp Leu Leu Trp Val Thr Arg Pro Leu Met 
290 295 300 



Gin Asn Gly Gly Pro Glu Thr Phe Thr Lys Leu Met Gly He Leu Ser 
305 310 315 320 



Asp Leu Leu Cys Gly Tyr Pro Glu Gly Gly Gly Ser Arg Val Leu Ser 
325 330 335 



Phe Asn Trp Tyr Glu Asp Asn Asn Tyr Lys Ala Phe Leu Gly He Asp 
340 345 " 350 



Ser Thr Arg Lys Asp Pro He Tyr Ser Tyr Asp Arg Arg Thr Thr Ser 
355 360 365 



Phe Cys Asn Ala Leu He Gin Ser Leu Glu Ser Asn Pro Leu Thr Lys 
370 375 380 



He Ala Trp Arg Ala Ala Lys Pro Leu Leu Met Gly Lys He Leu Tyr 
385 390 395 400 



Thr Pro Asp ser Pro Ala Ala Arg Arg He Leu Lys Asn Ala Asn Ser 
405 410 415 



Thr Phe Glu Glu Leu Glu His Val Arg Lys Leu Val Lys Ala Trp Glu 
420 425 430 



Glu Val Gly Pro Gin He Trp Tyr Phe Phe Asp Asn Ser Thr Gin Met 
435 440 445 
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Asn Met He Arg Asp Thr Leu Gly Asn Pro Thr Val Lys Asp Phe Leu 
450 455 460 

Asn Arg Gin Leu Gly Glu Glu Gly He Thr Ala Glu Ala He Leu Asn 
465 47 ° 475 480 

Phe Leu Tyr Lys Gly Pro Arg Glu Ser Gin Ala Asp Asp Met Ala Asn 
485 490 495 

Phe Asp Trp Arg Asp He Phe Asn He Thr Asp Arg Thr Leu Arg Leu 
5°° 505 510 

Val Asn Gin Tyr Leu Glu Cys Leu Val Leu Asp Lys Phe Glu Ser Tyr 
515 520 525 

Asn Asp. Glu Thr Gin Leu Thr Gin Arg Ala Leu Ser Leu Leu Glu Glu 
53° 535 • 540 

Asn Met Phe Trp Ala Gly Val Val Phe Pro Asp Met Tyr Pro Trp Thr 
545 550 555 ^ SS0 

Ser Ser Leu Pro Pro His Val Lys Tyr Lys He Arg Met Asp He Asp 
565 570 575 

Val Val Glu Lys Thr Asn Lys He Lys Asp Arg Tyr Trp Asp Ser Gly 
580 585 590 

Pro Arg Ala Asp Pro Val Glu Asp Phe Arg Tyr He Trp Gly Gly Phe 
5 95 600 6 os 

Ala Tyr Leu Gin Asp Met Val Glu Gin Gly He Thr Arg Ser Gin Val 
610 615 620 

Gin Ala Glu Ala Pro Val Gly He Tyr Leu Gin Gin Met Pro Tyr Pro 
625 "0 635 640 

Cys Phe Val Asp Asp Ser Phe Met He He Leu Asn Arg Cys Phe Pro 
6 45 650 655 

He Phe Met Val Leu Ala Trp He Tyr Ser Val Ser Met Thr Val Lys 
660 665 670 

Ser He Val Leu Glu Lys Glu Leu Arg Leu Lys Glu Thr Leu Lys Asn 
67 5 680 685 



Gin Gly Val Ser Asn Ala Val He Trp Cys Thr Trp Phe Leu Asp Ser 
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Phe Ser lie Met Ser Met Ser He Phe Leu Leu Thr He Phe He Met 
705 710 715 720 



His Gly Arg He Leu His Tyr Ser Asp Pro Phe lie Leu Phe Leu Phe 
725 730 735 



Leu Leu Ala Phe Ser Thr Ala Thr He Met Leu Cys Phe Leu Leu Ser 
740 745 750 



Thr Phe Phe Ser Lys Ala Ser Leu Ala Ala Ala Cys Ser Gly Val He 
755 760 765 



Tyr Phe Thr Leu Tyr Leu Pro His He Leu Cys Phe Ala Trp Gin Asp 
? 70 775 780 



Arg Met Thr Ala Glu Leu Lys Lys Ala Val Ser Leu Leu Ser Pro Val 
785 790 795 800 



Ala Phe Gly Phe Gly Thr Glu Tyr Leu Val Arg Phe Glu Glu Gin Gly 
805 810 815 



Leu Gly Leu Gin Trp Ser Asn He Gly Asn Ser Pro Thr Glu Gly Asp 
920 825 830 



Glu Phe Ser Phe Leu Leu Ser Met Gin Met Met Leu Leu Asp Ala Ala 
835 840 845 



Cys Tyr Gly Leu Leu Ala Trp Tyr Leu Asp Gin Val Phe Pro Gly Asp 
85 ° 855 860 



Tyr Gly Thr Pro Leu Pro Trp Tyr Phe Leu Leu Gin Glu Ser Tyr Trp 
865 870 875 



880 



Leu Ser Gly Glu Gly Cys Ser Thr Arg Glu Glu Arg Ala Leu Glu Lys 
B85 890 895 

Thr Glu Pro Leu Thr Glu Glu Thr Glu Asp Pro Glu His Pro Glu Gly 
900 905 910 



He His Asp Ser Phe Phe Glu Arg Glu His Pro Gly Trp Val Pro Gly 
915 920 925 



Val Cys Val Lys Asn Leu Val Lys He Phe Glu Pro Cys Gly Arq Pro 
930 935 940 
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Ala Val Asp Arg Leu Asn He Thr Phe Tyr Glu Asn Gin He Thr Ala 
945 " 950 955 960 



Phe Leu Gly His Asn Gly Ala Gly Lys Thr Thr Thr Leu Ser He Leu 
965 970 975 



Thr Gly Leu Leu Pro Pro Thr Ser Gly Thr Val Leu Val Gly Gly Arg 
980 985 990 



Asp He Glu Thr Ser Leu Asp Ala Val Arg Gin Ser Leu Gly Met Cys 
995 1000 1005 



Pro Gin His Asn He Leu Phe His His Leu Thr Val Ala Glu His 
1010 1015 1020 



Met Leu Phe Tyr Ala Gin Leu Lys Gly Lys Ser Gin Glu Glu Ala 
1025 1030 1035 



Gin Leu Glu Met Glu Ala Met Leu Glu Asp Thr Gly Leu His His 
1040 1045 1050 



Lys Arg Asn Glu Glu Ala Gin Asp Leu Ser Gly Gly Met Gin Arg 
1055 1060 1065 



Lys Leu Ser Val Ala He Ala Phe Val Gly Asp Ala Lys Val Val 
1070 1075 1080 



He Leu Asp Glu Pro Thr Ser Gly Val Asp Pro Tyr Ser Arg Arg 
1085 1090 1095 



Ser He Trp Asp Leu Leu Leu Lys Tyr Arg Ser Gly Arg Thr He 
1100 1105 1110 



He Met Pro Thr His His Met Asp Glu Ala Asp His Gin Gly Asp 
1115 1120 1125 



Arg He Ala He He Ala Gin Gly Arg Leu Tyr Cys Ser Gly Thr 
1130 1135 1140 



Pro Leu Phe Leu Lys Asn Cys Phe Gly Thr Gly Leu Tyr Leu Thr 
1145 1150 1155 



Leu Val Arg Lys Met Lys Asn He Gin Ser Gin Arg Lys Gly Ser 
1160 1165 1170 
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Glu Gly Thr Cys Ser Cys Ser Ser Lys Gly Phe Ser Thr Thr Cys 
H? 5 H80 Has 

Pro Ala His Val Asp Asp Leu Thr Pro Glu Gin Val Leu Asp Gly 
H90 1195 1200 

Asp Val Asn Glu Leu Met Asp Val Val Leu His His Val Pro Glu 
1205 1210 12 15 

Ala Lys Leu Val Glu Cys He Gly Gin Glu Leu He Phe Leu Leu 
1220 1225 1230 

Pro Asn Lys Asn Phe Lys His Arg Ala Tyr Ala Ser Leu Phe Arg 
1235 1240 1245 

Glu Leu Glu Glu Thr Leu Ala Asp Leu Gly Leu Ser Ser Phe Gly 
12 50 1255 1260 

He Ser Asp Thr Pro Leu Glu Glu He Phe Leu Lys Val Thr Glu 
1265 1270 1275 

Asp Ser Asp Ser Gly Pro Leu Phe Ala Gly Gly Ala Gin Gin Lys 
1280 1285 1290 

Arg Glu Asn Val Asn Pro Arg His Pro Cys Leu Gly Pro Arg Glu 
1295 130 0 1305 

Lys Ala Gly Gin Thr Pro Gin Asp Ser Asn Val Cys Ser Pro Gly 
1310 1315 1320 

Ala Pro Ala Ala His Pro Glu Gly Gin Pro Pro Pro Glu Pro Glu 
1325 1330 i33 5 

Cys Pro Gly Pro Gin Leu Asn Thr Gly Thr Gin Leu Val Leu Gin 
1340 1345 1350 

His Val Gin Ala Leu Leu Val Lys Arg Phe Gin His Thr He Arg 
1355 1360 1365 

Ser His Lys Asp Phe Leu Ala Gin He Val Leu Pro Ala Thr Phe 
1370 1375 1380 



Val Phe Leu Ala Leu Met Leu Ser He Val He Leu Pro Phe 
1385 1390 1395 
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Glu Tyr Pro Ala Leu Thr Leu His Pro Trp lie Tyr Gly Gin Gin 
1400 1405 1410 



Tyr Thr Phe Phe Ser Met Asp Glu Pro Gly Ser Glu Gin Phe Thr 
1415 1420 1425 



Val Leu Ala Asp Val Leu Leu Asn Lys Pro Gly Phe Gly Asn Arg 
1430 1435 1440 



Cys Leu Lys Glu Gly Trp Leu Pro Glu Tyr Pro Cys Gly Asn Ser 
1445 1450 1455 



Thr Pro Trp Lys Thr Pro Ser Val Ser Pro Asn He Thr Gin Leu 
1460 1465 1470 



Phe Gin Lys Gin Lys Trp Thr Gin Val Asn Pro Ser Pro Ser Cys 
■ 1475 1480 1485 



Arg Cys Ser Thr Arg Glu Lys Leu Thr Met Leu Pro Glu Cys Pro 
1490 1495 1500 



Glu Gly Ala Gly Gly Leu Pro Pro Pro Gin Arg Thr Gin Arg Ser 
1505 1510 1515 



Thr Glu He Leu Gin Asp Leu Thr Asp Arg Asn He Ser Asp Phe 
1520 1525 1530 



Leu Val Lys Thr Tyr Pro Ala Leu He Arg Ser Ser Leu Lys Ser 
1535 1540 1545 



Lys Phe Trp Val Asn Glu Gin Arg Tyr Gly Gly He Ser He Gly 
1550 1555 1560 



Gly Lys Leu Pro Val Val Pro He Thr Gly Glu Ala Leu Val Gly 
1565 1570 1575 



Phe Leu Ser Asp Leu Gly Arg He Met Asn Val Ser Gly Gly Pro 
1580 1585 1590 



He Thr Arg Glu Ala Ser Lys Glu lie Pro Asp Phe Leu Lys His 
1595 1600 1605 



Leu Glu Thr Glu Asp Asn He Lys Val Trp Phe Asn Asn Lys Gly 
1610 1615 1620 



A 



Trp His Ala Leu Val Ser Phe Leu Asn Val Ala His Asn Ala He 
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L6U LSU Pr ° LyS As P Ar 9 s « Pro Glu Glu Tyr Gly 

1640 i 645 * y 



1650 



He Thr val He Ser Gin Pro Leu Asn Leu Thr Lys Glu Gin Leu 
1655 xeeo 1665 

Ser Glu He Thr Val Leu Thr Thr Ser Val Asp Ala Val Val Ala 
1670 1675 1680 

He Cys val He Phe Ser Met Ser Phe Val Pro Ala Ser Phe Val 
1685 1690 1695 

LSU ^ 116 Gln G1U Arg Val Asn Ser His Leu Gin 

1700 1 7 °5 1710 

Phe lie ser Gly Val Ser Pro Thr Thr Tyr Trp Val Thr Asn Phe 
1715 1720 1725 

iSU «L ASP 116 Met ASn Tyr Ser val Ser Ala Gly Leu Val Val 
1730 1735 174Q 

Gly ™ Ph& 116 Gly PhS Gln Lys h Y* Ala T V r T hr Ser Pro Glu 
1745 1750 1755 

ASI1 ™ Pr ° Ma LSU Val Ala Leu Leu Leu L « u Tyr Gly Trp Ala 
1760 1765 17?0 

Val ?i" Pro Met Met Pro Ser Phe Leu Phe Asp Val Pro 

1775 1780 178S 

™ Ma ^ Val Wa Leu Ser Ala Asn l«u Phe He Gly 
1790 1795 1800 

He Asn Ser Ser Ala He Thr Phe He Leu Glu Leu Phe Asp Asn 
1805 1810 i 815 

Jff« LSU LSU ArS Phe Asn Ala Val Leu Arg Lys Leu Leu 

1820 1825 183Q 

He val Phe Pro His Phe Cys Leu Gly Arg Gly Leu He Asp Leu 
1835 le40 1845 

Ala Leu Ser Gln Ala Val Thr Asp Val Tyr Ala Arg Phe Gly Glu • 
1850 lass le60 
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Glu His Ser Ala Asn Pro Phe His Trp Asp Leu lie Gly Lys Asn 

Leu Phe Q Ala Met Val Val Glu Gly Val Val Tyr Phe Leu Leu Thr 

1885 1890 

Leu Leu Val Gin Arg His Phe Phe Leu Ser Gin Trp lie Ala Glu 
1895 1905 

Pro Thr Lys Glu Pro He Val Asp Glu Asp Asp Asp Val Ala Glu 
1910 1920 

Glu Arg_ Gin Arg He He Thr^ Gly Gly Asn Lys Thr_ Asp lie Leu 

Arg Leu His Glu Leu Thr Lys lie Tyr Leu Gly Thr Ser Ser Pro 

1945 1950 

Ala AS P Leu Cys Val^ Gly Val Arg Pro Gly Glu Cys Phe 

Gly JSo ° ly Val ASQ GlY Ala Gly Lys Thr Thr Thr Phe I*. 
1970 1375 1980 

Met Leu Thr Gly Asp Thr Thr Val Thr Ser Gly Asp Ala Thr Val 

1990 lg95 



Ma tllo ^ 5* ASa 11S Ser Glu Val His «» Asn 

2000 2005 2010 

Met Gly Tyr Cys Pro Gin Phe Asp Ala He Asp Glu Leu Leu Thr 

2020 2025 

Gly Arg Glu His Leu Tyr Leu Tyr Ala Arg Leu Arg Gly Val Pro 
2030 203 5 2040 

Ala Glu Glu lie Glu Lys Val Ala Asn Trp Ser lie Lys Ser Leu 

2050 2055 

2060 ^ ^ ^ Ala Thr Ser Gly 

2055 2070 

Gly A *» hys *** *** Leu s « ™r Ala lie Ala Leu He Gly Cys 
zv /z> 2080 



2085 
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Pro Pro Leu Val Leu Leu Asp Glu Pro Thr Thr Gly Met Asp Pro 
2090 2095 2100 



Gin Ala Arg Arg Met Leu Trp Asn Val lie Val Ser lie lie Arg 
2105 2110 2115 



Lys Gly Arg Ala Val Val Leu Thr Ser His Ser Met Glu Glu Cys 
2120 2125 2130 



Glu Ala Leu Cys Thr Arg Leu Ala lie Met Val Lys Gly Ala Phe 
2135 2140 2145 



Arg Cys Met Gly Thr lie Gin His Leu Lys Ser Lys Phe Gly Asp 
2150 2155 2160 



Gly Tyr He Val Thr Met Lys lie Lys Ser Pro Lys Asp Asp Leu 
2165 2170 2175 



Leu Pro Asp Leu Asn Pro Val Glu Gin Phe Phe Gin Gly Asn Phe 
2180 2185 2190 



Pro Gly Ser Val Gin Arg Glu Arg His Tyr Asn Met Leu Gin Phe 
2195 2200 2205 



Gin Val Ser Ser Ser Ser Leu Ala Arg He Phe Gin Leu Leu Leu 
2210 2215 2220 



Ser His Lys Asp Ser Leu Leu He Glu Glu Tyr Ser Val Thr Gin 
2225 2230 223*5 



Thr Thr Leu Asp Gin Val Phe Val Asn Phe Ala Lys Gin Gin Thr 
2240 2245 2250 



Glu Ser His Asp Leu Pro Leu His Pro Arg Ala Ala Gly Ala Ser 
2255 2260 2265 



Arg Gin Ala Gin Asp 
2270 



<210> 71 

<211> 560 

<212> PRT 

<213> Homo Sapiens 

<400> 71 



Met Val Pro His Ala He Leu Ala Arg Gly Arg Asp Val Cys Arg Arg 
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Asn Gly Leu Leu lie Leu Ser Val Leu Ser Val lie Val Gly Cys Leu 
20 25 30 



Leu Gly Phe Phe Leu Arg Thr Arg Arg Leu Ser Pro Gin Glu lie Ser 
35 40 45 



Tyr Phe Gin Phe Pro Gly Glu Leu Leu Met Arg Met Leu Lys Met Met 
50 55 60 



He Leu Pro Leu Val Val Ser Ser Leu Met Ser Gly Leu Ala Ser Leu 
65 70 75 80 



Asp Ala Lys Thr Ser Ser Arg Leu Gly Val Leu Thr Val Ala Tyr Tyr 
85 90 95 



Leu Trp Thr Thr Phe Met Ala Val He Val Gly He Phe Met Val Ser 
100 105 no 



He He His Pro Gly Ser Ala Ala Gin Lys Glu Thr Thr Glu Gin Ser 
115 120 125 



Gly Lys Pro He Met Ser Ser Ala Asp Ala Leu Leu Asp Leu He Arg 
130 135 140 



Asn Met Phe Pro Ala Asn Leu Val Glu Ala Thr Phe Lys Gin Tyr Ara 
145 150 155 1€ c 

Thr Lys Thr Thr Pro Val Val Lys Ser Pro Lys Val Ala Pro Glu Glu 
165 170 175 

Ala Pro Pro Arg Arg He Leu He Tyr Gly Val Gin Glu Glu Asn Gly 

180 185 190 



Ser His Val Gin Asn Phe Ala Leu Asp Leu Thr Pro Pro Pro Glu Val 
195 200 205 



Val Tyr Lys Ser Glu Pro Gly Thr Ser Asp Gly Met Asn Val Leu Gly 
210 215 220 



He Val Phe Phe Ser Ala Thr Met Gly He Met Leu Gly Arg Met Gly 
225 230 235 ~ 240 



Asp Ser Gly Ala Pro Leu Val Ser Phe Cys Gin Cys Leu Asn Glu Ser 
245 250 255 
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Val Met Lys He Val Ala Val Ala Val Trp Tyr Phe Pro Phe Gly lie 
260 265 270 



Val Phe Leu lie Ala Gly Lys He Leu Glu Met Asp Asp Pro Arg Ala 
275 280 



285 



Val Gly Lys Lys Leu Gly Phe Tyr Ser Val Thr Val Val Cys Gly Leu 
290 295 300 



Val Leu His Gly Leu Phe He Leu Pro Leu Leu Tyr Phe Phe He Thr 
305 ' 310 315 320 

Lys Lys Asn Pro He Val Phe He Arg Gly He Leu Gin Ala Leu Leu 
325 330 335 

He Ala Leu Ala Thr Ser Ser Ser Ser Ala Thr Leu Pro lie Thr Phe 
340 345 350 

hys Cys Leu Leu Glu Asn Asn His lie Asp Arg Arg He Ala Arg Phe 
355 260 365 



Val Leu Pro Val Gly Ala Thr He Asn Met Asp Gly Thr Ala Leu Tyr 



375 



380 



Glu Ala Val Ala Ala He Phe He Ala Gin Val Asn Asn Tyr Glu Leu 
385 390 395 400 

Asp Phe Gly Gin He He Thr He Ser lie Thr Ala Thr Ala Ala Ser 
405 410 415 

lie Gly Ala Ala Gly He Pro Gin Ala Gly Leu Val Thr Met Val He 
420 425 430 

Val Leu Thr Ser Val Gly Leu Pro Thr Asp Asp He Thr Leu He He 
43 5 440 445 

Ala Val Asp Trp Ala Leu Asp Arg Phe Arg Thr Met He Asn Val Leu 
450 455 460 



Gly Asp Ala Leu Ala Ala Gly He Met Ala His He Cys Arg Lys Asp 



470 



475 



480 



Phe Ala Arg Asp Thr Gly Thr Glu Lys Leu Leu Pro Cys Glu Thr Lvs 
485 490 



495 
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Pro Val Ser Leu Gin Glu He Val Ala Ala Gin Gin Asn Gly Cys Val 
500 505 sio 

Lys Ser Val Ala Glu Ala Ser Glu Leu Thr Leu Gly Pro Thr Cys Pro 
515 520 525 

His His Val Pro Val Gin Val Glu Arg Asp Glu Glu Leu Pro Ala Ala 
530 535 540 



Ser Leu Asn His Cys Thr He Gin He Ser Glu Leu Glu Thr Asn Val 

560 



545 5 50 555 



<210> 72 

<211> 840 

<212> PRT 

<213> Homo Sapiens 

<400> 72 

Met Val Thr Val Gly Asn Tyr Cys Glu Ala Glu Gly Pro Val Gly Pro 
1 5 10 15 

Ala Trp Met Gin Asp Gly Leu Ser Pro Cys Phe Phe Phe Thr Leu Val 
20 25 30 

Pro Ser Thr Arg Met Ala Leu Gly Thr Leu Ala Leu Val Leu Ala Leu 
35 40 45 

Pro Cys Arg Arg Arg Glu Arg Pro Ala Gly Ala Asp Ser Leu Ser Trp 

55 60 

Gly Ala Gly Pro Arg He Ser Pro Tyr Val Leu Gin Leu Leu Leu Ala 
65 70 75 80 

Thr Leu Gin Ala Ala Leu Pro Leu Ala Gly Leu Ala Gly Arg Val Gly 
85 90 95 

Thr Ala Arg Gly Ala Pro Leu Pro Ser Tyr Leu Leu Leu Ala Ser Val 
100 105 no 

Leu Glu Ser Leu Ala Gly Ala Cys Gly Leu Trp Leu Leu Val Val Glu 
115 120 i 2 5 

Arg Ser Gin Ala Arg Gin Arg Leu Ala Met Gly He Trp He Lvs Phe 
130 135 140 



Ser Pro Gly Leu Leu Leu Leu Trp Thr Val Ala Phe Ala Ala 
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145 150 155 160 



Glu Asn Leu Ala Leu Val Ser Trp Asn Ser Pro Gin Trp Trp Trp Ala 
165 170 175 



Arg Ala Asp Leu Gly Gin Gin Val Gin Phe Ser Leu Trp Val Leu Arg 
180 185 190 



Tyr Val Val Ser Gly Gly Leu Phe Val Leu Gly Leu Trp Ala Pro Gly 
195 200 205 



Leu Arg Pro Gin Ser Tyr Thr Leu Gin Val His Glu Glu Asp Gin Asp 
210 215 220 



Val Glu Arg Ser Gin Val Arg Ser Ala Ala Gin Gin Ser Thr Trp Arg 
225 230 235 240 



Asp Phe Gly Arg Lys Leu Arg Leu Leu Ser Gly Tyr Leu Trp Pro Arg 
245 250 255 



Gly Ser Pro Ala Leu Gin Leu Val Val Leu lie Cys Leu Gly Leu Met 
. 260 265 270 



Gly Leu Glu Arg Ala Leu Asn Val Leu Val Pro lie Phe Tyr Arg Asn 
275 280 285 



He Val Asn Leu Leu Thr Glu Lys Ala Pro Trp Asn Ser Leu Ala Trp 
290 295 300 



Thr Val Thr Ser Tyr Val Phe Leu Lys Phe Leu Gin Gly Gly Gly Thr 
305 310 315 * 320 



Gly Ser Thr Gly Phe Val Ser Asn Leu Arg Thr Phe Leu Trp He Arg 
' 325 330 335 



Val Gin Gin Phe Thr Ser Arg Arg Val Glu Leu Leu He Phe Ser His 
340 345 350 



Leu His Glu Leu Ser Leu Arg Trp His Leu Gly Arg Arg Thr Gly Glu 
355 360 365 



Val Leu Arg He Ala Asp Arg Gly Thr Ser Ser Val Thr Gly Leu Leu 
370 375 380 



Ser Tyr Leu Val Phe Asn Val tie Pro Thr Leu Ala Asp He He He 
385 390 395 " 400 
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Gly He He Tyr Phe Ser Met Phe Phe Asn Ala Trp Phe Gly Leu He 
405 410 415 



Val Phe Leu Cys Met Ser Leu Tyr Leu Thr Leu Thr He Val Val Thr 
420 425 430 



Glu Trp Arg Thr Lys Phe Arg Arg Ala Met Asn Thr Gin Glu Asn Ala 
435 440 445 



Thr Arg Ala Arg Ala Val Asp Ser Leu Leu Asn Phe Glu Thr Val Lys 
450 455 460 



Tyr Tyr Asn Ala Glu Ser Tyr Glu Val Glu Arg Tyr Arg Glu Ala He 
465 470 475 480 



He Lys Tyr Gin Gly Leu Glu Trp Lys Ser Ser Ala Ser Leu Val Leu 
485 490 495 



Leu Asn Gin Thr Gin Asn Leu Val lie Gly Leu Gly Leu Leu Ala Gly 
500 505 510 



Ser Leu Leu Cys Ala Tyr Phe Val Thr Glu Gin Lys Leu Gin Val Gly 
515 520 ' 525 



Asp Tyr Val Leu Phe Gly Thr Tyr He He Gin Leu Tyr Met Pro Leu 
530 535 540 



Asn Trp Phe Gly Thr Tyr Tyr Arg Met He Gin Thr Asn Phe He Asp 
545 550 555 560 



Met Glu Asn Met Phe Asp Leu Leu Lys Glu Glu Thr Glu Val Lys Asp 
565 570 575 



Leu Pro Gly Ala Gly Pro Leu Arg Phe Gin Lys Gly Arg He Glu Phe 
580 585 ~ 590 



Glu Asn Val His Phe Ser Tyr Ala Asp Gly Arg Glu Thr Leu Gin Asp 
595 600 605 



Val Ser Phe Thr Val Met Pro Gly Gin Thr Leu Ala Leu Val Gly Pro 
610 615 620 



Ser Gly Ala Gly Lys Ser Thr He Leu Arg Leu Leu Phe Arg Phe Tyr 
625 630 635 ~ 640 
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Asp lie Ser Ser Gly Cys lie Arg He Asp Gly Gin Asp He ser Gin 
645 650 655 

Val Thr Gin Ala Ser Leu Arg Ser His He Gly Val Val Pro Gin Asp 
66 <> 665 670 

Thr Val Leu Phe Asn Asp Thr He Ala Asp Asn He Arg Tyr Gly Arg 
675 680 685 

Val Thr Ala Gly Asn Asp Glu Val Glu Ala Ala Ala Gin Ala Ala Gly 
6 9° 695 700 

He His Asp Ala He Met Ala Phe Pro Glu Gly Tyr Arg Thr Gin Val 
705 710 715 720 

Gly Glu Arg Gly . Leu Lys Leu Ser Gly Gly Glu Lys Gin Arg Val Ala 
725 730 735 

He Ala Arg Thr He Leu Lys Ala Pro Gly He lie Leu Leu Asp Glu 
740 745 750 

Ala Thr Ser Ala Leu Asp Thr Ser Asn Glu Arg Ala He Gin Ala Ser 
7 55 760 765 

Leu Ala Lys Val Cys Ala Asn Arg Thr Thr He Val Val Ala His Arg 
77 ° 775 780 

Leu Ser Thr Val Val Asn Ala Asp Gin He Leu Val He Lys Asp Gly 
785 7 *° 795 800 

Cys He Val Glu Arg Gly Arg His Glu Ala Leu Leu Ser Arg Gly Gly 
805 810 815 

Val Tyr Ala Asp Met Trp Gin Leu Gin Gin Gly Gin Glu Glu Thr Ser 
82 ° 825 830 

Glu Asp Thr Lys Pro Gin Thr Met 
835 840 



<210> 73 

<211> 332 

<212> PRT 

<213> Homo Sapiens 

<400> 73 



Met Leu Leu Glu Thr Gin Asp Ala Leu Tyr Val Ala Leu Glu Leu Val 
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lie Ala Ala Leu Ser Val Ala Gly Asn Val Leu Val Cys Ala Ala Val 
20 25 30 



Gly Thr Ala Asn Thr Leu Gin Thr Pro Thr Asn Tyr Phe Leu Val Ser 
35 4Q 45 



lieu Ala Ala Ala Asp Val Ala val Gly Leu Phe Ala He Pro Phe Ala 
50 55 60 



lie Thr He Ser Leu Gly Phe Cys Thr Asp Phe Tyr Gly Cys Leu Phe 
65 70 75 ~ BO 



Leu Ala Cys Phe Val Leu Val Leu Thr Gin Ser Ser He Phe Ser Leu 
85 90 95 



Leu Ala Val Ala Val Asp Arg Tyr Leu Ala He Cys Val Pro Leu Arg 
100 105 " no 



Tyr Lys Ser Leu Val Thr Gly Thr Arg Ala Arg Gly Val He Ala Val 
115 120 125 



Leu Trp Val Leu Ala Phe Gly He Gly Leu Thr Pro Phe Leu Gly Trp 
130 135 140 



Asn Ser Lys Asp Ser Ala Thr Asn Asn Cys Thr Glu Pro Trp Asp Gly 
145 150 155 " iso 



Thr Thr Asn Glu Ser Cys Cys Leu Val Lys Cys Leu Phe Glu Asn Val 
165 170 175 



Val Pro Met Ser Tyr Met Val Tyr Phe Asn Phe Phe Gly Cys Val Leu 
180 185 190 



Pro Pro Leu Leu He Met Leu Val lie Tyr He Lys lie Phe Leu Val 
195 200 205 



Ala Cys Arg Gin Leu Gin Arg Thr Glu Leu Met Asp His Ser Arg Thr 
210 215 220 



Thr Leu Gin Arg Glu He His Ala Ala Lys Ser Leu Ala Met He Val 
225 230 235 240 



Gly He Phe Ala Leu Cys Trp Leu Pro Val His Ala Val Asn Cys Val 
245 250 255 
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Thr Leu Phe Gin Pro Ala Gin Gly Lys Asn Lys Pro Lys Trp Ala Met 
260 265 270 



Asn Met Ala lie Leu Leu Ser His Ala Asn Ser Val Val Asn Pro lie 
275 280 285 



Val Tyr Ala Tyr Arg Asn Arg Asp Phe Arg Tyr Thr Phe His Lys lie 
290 295 300 



He Ser Arg Tyr Leu Leu Cys Gin Ala Asp Val Lys Ser Gly Asn Gly 
305 310 315 ' 320 



Gin Ala Gly Val Gin Pro Ala Leu Gly Val Gly Leu 
325 330 



<210> 74 

<211> 180 

<212> PRT 

<213> Homo Sapiens 

<400> 74 

Met Gly Leu Gly Ala Arg Gly Ala Trp Ala Ala Leu Leu Leu Gly Thr 
15 io 15 



Leu Gin Val Leu Ala Leu Leu Gly Ala Ala His Glu Ser Ala Ala Met 
20 25 30 



Ala Glu Thr Leu Gin His Val Pro Ser Asp His Thr Asn Glu Thr Ser 
35 40 45 



Asn Ser Thr Val Lys Pro Pro Thr Ser Val Ala Ser Asp Ser Ser Asn 
50 55 60 



Thr Thr Val Thr Thr Met Lys Pro Thr Ala Ala Ser Asn Thr Thr Thr 
65 70 75 80 



Pro Gly Met Val Ser Thr Asn Met Thr Ser Thr Thr Leu Lys Ser Thr 
85 90 ^ 95 



Pro Lys Thr Thr Ser Val Ser Gin Asn Thr Ser Gin lie Ser Thr Ser 
100 105 no 



Thr Met Thr Val Thr His Asn Ser Ser Val Thr Ser Ala Ala Ser Ser 
115 120 125 
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Val Thr He Thr Thr Thr Met His Ser Glu Ala Lys Lys Gly Ser Lys 
130 135 140 

Phe Asp Thr Gly Ser Phe Val Gly Gly He Val Leu Thr Leu Gly Val 
145 150 ~ 15S 160 

Leu Ser He Leu Tyr He Gly Cys Lys Met Tyr Tyr Ser Arg Arg Gly 
165 170 175 

He Arg Tyr Arg 
180 



<210> 75 

<211> 240 

<212> PRT 

<213> Homo Sapiens 

<400> 75 

Met Ala Gin His Gly Ala Met Gly Ala Phe Arg Ala Leu Cys Gly Leu 
15 io 15 

Ala Leu Leu Cys Ala Leu Ser Leu Gly Gin Arg Pro Thr Gly Gly Pro 

25 30 

Gly Cys Gly Pro Gly Arg Leu Leu Leu Gly Thr Gly Thr Asp Ala Arg 
35 40 45 



Cys Cys Arg Val His Thr Thr Arg Cys Cys Arg Asp Tyr Pro Gly Glu 
50 55 60 



Glu Cys Cys Ser Glu Trp Asp Cys Met Cys Val Gin Pro Glu Phe His 

65 70 75 80 

Cys Gly Asp Pro Cys Cys Thr Thr Cys Arg His His Pro Cys Pro Pro 
85 90 95 

Gly Gin Gly Val Gin Ser Gin Gly Lys Phe Ser Phe Gly Phe Gin Cys 
100 105 no 



He Asp Cys Ala Ser Gly Thr Phe Ser Gly Gly His Glu Gly His Cys 
115 120 125 



Lys Pro Trp Thr Asp Cys Thr Gin Phe Gly Phe Leu Thr Val Phe Pro 
130 135 140 



Gly Asn Lys Thr His Asn Ala Val Cys Val Pro Gly Ser Pro Pro Ala 
145 150 155 160 
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Glu Pro Leu Gly Trp Leu Thr Val Val Leu Leu Ala Val Ala Ala Cys 
165 170 175 



Val Leu Leu Leu Thr Ser Ala Gin Leu Gly Leu His lie Trp Gin Leu 
180 185 190 



Arg Ser Gin Cys Met Trp Pro Arg Glu Thr Gin Leu Leu Leu Glu Val 
195 200 205 



Pro Pro Ser Thr Glu Asp Ala Arg Ser Cys Gin Phe Pro Glu Glu Glu 
210 215 220 



Arg Gly Glu Arg Ser Ala Glu Glu Lys Gly Arg Leu Gly Asp Leu Trp 
225 230 235 ' ' 240 



<210> 76 

<211> 514 

<212> PRT 

<213> Homo Sapiens 

<400> 76 

Met Gly Cys Asp Gly Arg Val Ser Gly Leu Leu Arg Arg Asn Leu Gin 
15 10 15 



Pro Thr Leu Thr Tyr Trp Ser Val Phe Phe Ser Phe Gly Leu Cys lie 
20 25 30 



Ala Phe Leu Gly Pro Thr Leu Leu Asp Leu Arg Cys Gin Thr His Ser 
35 40 " 45 



Ser Leu Pro Gin lie Ser Trp Val Phe Phe Ser Gin Gin Leu Cys Leu 
50 55 60 



Leu Leu Gly Ser Ala Leu Gly Gly Val Phe Lys Arg Thr Leu Ala Gin 
65 70 75 80 



Ser Leu Trp Ala Leu Phe Thr Ser Ser Leu Ala lie Ser Leu Val Phe 
85 $o 95 



Ala Val lie Pro Phe Cys Arg Asp Val Lys Val Leu Ala Ser Val Met 
100 105 no 



Ala Leu Ala Gly Leu Ala Met Gly Cys lie Asp Thr Val Ala Asn Met 
115 120 125 
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Gin Leu Val Arg Met Tyr Gin Lys Asp ser Ala Val Phe Leu Gin Val 
130 135 140 

Leu His Phe Phe Val Gly Phe Gly Ala Leu Leu Ser Pro Leu He Ala 
145 150 155 ieo 

Asp Pro Phe Leu Ser Glu Ala Asn Cys Leu Pro Ala Asn Ser Thr Ala 
165 170 175 

Asn Thr Thr Ser Arg Gly His Leu Phe His Val Ser Arg Val Leu Gly 
180 185 190 

Gin His His Val Asp Ala Lys Pro Trp Ser Asn Gin Thr Phe Pro Gly 
195 200 205 

Leu Thr Pro Lys Asp Gly Ala Gly Thr Arg Val Ser Tyr Ala Phe Trp 
210 215 220 

He Met Ala Leu He Asp Leu Pro Val Pro Met Ala Val Leu Met Leu 
225 230 235 240 

Leu Ser Lys Glu Arg Leu Leu Thr Cys Cys Pro Gin Arg Arg Pro Leu 
245 250 255 

Leu Leu Ser Ala Asp Glu Leu Ala Leu Glu Thr Gin Pro Pro Glu Lys 
260 265 270 

Glu Asp Ala Ser Ser Leu Pro Pro Lys Phe Gin Ser His Leu Gly His 
275 280 285 

Glu Asp Leu Phe Ser Cys Cys Gin Arg Lys Asn Leu Arg Gly Ala Pro 
250 295 300 

Tyr Ser Phe Phe Ala He His He Thr Gly Ala Leu Val Leu Phe Met 
305 . 310 315 3 20 

Thr Asp Gly Leu Thr Gly Ala Tyr Ser Ala Phe Val Tyr Ser Tyr Ala 
325 330 335 

Val Glu Lys Pro Leu Ser Val Gly His Lys Val Ala Gly Tyr Leu Pro 
340 345 350 

Ser Leu Phe Trp Gly Phe lie Thr Leu Gly Arg Leu Leu Ser He Pro 
355 360 365 



Ser Ser Arg Met Lys Pro Ala Thr Met Val Phe He Asn Val Val 
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Gly Val Val Val Thr Phe Leu Val Leu Leu lie Phe Ser Tyr Asn Val 
385 390 395 400 

Val Phe Leu Phe Val Gly Thr Ala Ser Leu Gly Leu Phe Leu Ser Ser 
405 4i 0 415 

Thr Phe Pro Ser Met Leu Ala Tyr Thr Glu Asp Ser Leu Gin Tyr Lys 
420 425 430 

Gly Cys Ala Thr Thr Val Leu Val Thr Gly Ala Gly Val Gly Glu Met 
435 440 445 

Val Leu Gin Met Leu Val Gly Ser lie Phe Gin Ala Gin Gly Ser Tvr 
450 455 46O 



Ser Phe Leu Val Cys Gly Val He Phe Gly Cys Leu Ala Phe Thr Phe 
465 470 475 480 



Tyr He Leu Leu Leu Phe Phe His Arg Met His Pro Gly Leu Pro Ser 
485 490 495 

Val Pro Thr Gin Asp Arg Ser He Gly Met Glu Asn Ser Glu Cys Tyr 
500 505 510 

Gin Arg 



<210> 77 

<211> 1181 

<212> PRT 

<213> Homo Sapiens 

<400> 77 

Met Gly Pro Glu Arg Thr Gly Ala Ala Pro Leu Pro Leu Leu Leu Val 
15 io 15 

Leu Ala Leu Ser Gin Gly He Leu Asn Cys Cys Leu Ala Tyr Asn Val 
20 25 30 

Gly Leu Pro Glu Ala Lys He Phe Ser Gly Pro Ser Ser Glu Gin Phe 
35 40 45 



Gly Tyr Ala Val Gin Gin Phe lie Asn Pro Lys Gly Asn Trp Leu Leu 
50 55 so 
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Val Gly Ser Pro Trp ser Gly Phe Pro Glu Asn Arg Met Gly Asp Val 
65 70 75 * 80 



Tyr Lys Cys Pro Val Asp Leu Ser Thr Ala Thr Cys Glu Lys Leu Asn 
85 90 95 



Leu Gin Thr Ser Thr Ser He Pro Asn Val Thr Glu Met Lys Thr Asn 
100 105 no 



Met Ser Leu Gly Leu He Leu Thr Arg Asn Met Gly Thr Gly Gly Phe 
115 120 125 



Leu Thr Cys Gly Pro Leu Trp Ala Gin Gin Cys Gly Asn Gin Tyr Tvr 
130 135 140 



Thr Thr Gly Val Cys Ser Asp He Ser Pro Asp Phe Gin Leu Ser Ala 
!45 150 155 i 6 o 



Ser Phe Ser Pro Ala Thr Gin Pro Cys Pro Ser Leu He Asp Val Val 
165 170 175 



Val Val Cys Asp Glu Ser Asn Ser lie Tyr Pro Trp Asp Ala Val Lys 
180 185 190 



Asn Phe Leu Glu Lys Phe Val Gin Gly Leu Asp He Gly Pro Thr Lys 
195 200 205 



Thr Gin Val Gly Leu He Gin Tyr Ala Asn Asn Pro Arg Val Val Phe 
210 215 220 



Asn Leu Asn Thr Tyr Lys Thr Lys Glu Glu Met He Val Ala Thr Ser 
225 230 235 240 



Gin Thr Ser Gin Tyr Gly Gly Asp Leu Thr Asn Thr Phe Gly Ala lie 
245 250 255 



Gin Tyr Ala Arg Lys Tyr Ala Tyr Ser Ala Ala Ser Gly Gly Arg Arg 
260 265 270 



Ser Ala Thr Lys Val Met Val Val Val Thr Asp Gly Glu Ser His Asp 
275 280 285 



Gly Ser Met Leu Lys Ala Val He Asp Gin Cys Asn His Asp Asn He 
290 295 300 
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Leu Arg Phe Gly lie Ala Val Leu Gly Tyr Leu Asn Arg Asn Ala Leu 
305 310 315 3 2 o 

Asp Thr Lys Asn Leu lie Lys Glu lie Lys A la He Ala Ser He Pro 
325 330 335 

Thr Glu Arg Tyr Phe Phe Asn Val Ser Asp Glu Ala Ala Leu Leu Glu 
340 345 350 

Lys Ala Gly Thr Leu Gly Glu Gin He Phe Ser He Glu Gly Thr Val 
355 360 365 

Gin Gly Gly Asp Asn Phe Gin Met Glu Met Ser Gin Val Gly Phe Ser 
370 375 360 

Ala Asp Tyr Ser Ser Gin Asn Asp He Leu Met Leu Gly Ala Val Gly 
385 390 395 400 

Ala Phe Gly Trp Ser Gly Thr He Val Gin Lys Thr Ser His Gly His 
405 410 41S 

Leu He Phe Pro Lys Gin Ala Phe Asp Gin He Leu Gin Asp Arg Asn 
420 425 43Q 

His Ser Ser Tyr Leu Gly Tyr Ser Val Ala Ala He Ser Thr Gly Glu 
435 440 445 

Ser Thr His Phe Val Ala Gly Ala Pro Arg Ala Asn Tyr Thr Gly Gin 
450 455 460 

lie Val Leu Tyr Ser Val Asn Glu Asn Gly Asn He Thr Val He Gin 
465 470 475 480 

Ala His Arg Gly Asp Gin He Gly Ser Tyr Phe Gly Ser Val Leu Cys 
485 490 495 

Ser Val Asp Val Asp Lys Asp Thr He Thr Asp Val Leu Leu Val Gly 
500 505 510 

Ala Pro Met Tyr Met Ser Asp Leu Lys Lys Glu Glu Gly Arg Val Tyr 
515 520 525 

Leu Phe Thr He Lys Lys Gly lie Leu Gly Gin His Gin Phe Leu Glu 
530 535 540 



Gly Pro Glu Gly H e Glu Asn Thr Arg Phe Gly Ser Ala He Ala Ala 
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545 550 555 560 



Leu Ser Asp He Asn Met Asp Gly Phe Asn Asp Val He Val Gly Ser 
565 570 575 



Pro Leu Glu Asn Gin Asn Ser Gly Ala Val Tyr He Tyr Asn Gly His 
580 585 590 



Gin Gly Thr He Arg Thr Lys Tyr Ser Gin Lys He Leu Gly Ser Asp 
595 600 605 



Gly Ala Phe Arg Ser His Leu Gin Tyr Phe Gly Arg Ser Leu Asp Gly 
610 615 62 0 



Tyr Gly Asp Leu Asn Gly Asp Ser lie Thr Asp Val Ser He Gly Ala 
625 630 635 ^ 640 



Phe Gly Gin Val Val Gin Leu Trp Ser Gin Ser lie Ala Asp Val Ala 
645 650 * 655 



He Glu Ala Ser Phe Thr Pro Glu Lys He Thr Leu Val Asn Lys Asn 
660 665 670 



Ala Gin He He Leu Lys Leu Cys Phe Ser Ala Lys Phe Arg Pro Thr 
675 680 685 



Lys Gin Asn Asn Gin Val Ala lie Val Tyr Asn He Thr Leu Asp Ala 
690 695 700 



Asp Gly Phe Ser Ser Arg Val Thr Ser Arg Gly Leu Phe Lys Glu Asn 
705 710 715 * 720 



Asn Glu Arg Cys Leu Gin Lys Asn Met Val Val Asn Gin Ala Gin Ser 
725 730 735 



Cys Pro Glu His He He Tyr He Gin Glu Pro Ser Asp Val Val Asn 
740 745 750 



Ser Leu Asp Leu Arg Val Asp He Ser Leu Glu Asn Pro Gly Thr Ser 
755 760 765 



Pro Ala Leu Glu Ala Tyr Ser Glu Thr Ala Lys Val Phe Ser He Pro 
770 775 780 



Phe His Lys Asp Cys Gly Glu Asp Gly Leu Cys He Ser Asp Leu Val 
785 790 795 '~ 800 
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Leu Asp Val Arg Gin lie Pro Ala Ala Gin Glu Gin Pro Phe He Val 
805 810 815 



Ser Asn Gin Asn Lys Arg Leu Thr Phe Ser Val Thr Leu Lys Asn Lys 
820 825 830 



Arg Glu Ser Ala Tyr Asn Thr Gly He Val Val Asp Phe Ser Glu Asn 
835 840 845 



Leu Phe Phe Ala Ser Phe Ser Leu Pro Val Asp Gly Thr Glu Val Thr 
850 855 860 



Cys Gin Val Ala Ala Ser Gin Lys Ser Val Ala Cys Asp Val Gly Tyr 
865 870 875 " 880 



Pro Ala Leu Lys Arg Glu Gin Gin Val Thr Phe Thr He Asn Phe Asp 
885 890 895 



Phe Asn Leu Gin Asn Leu Gin Asn Gin Ala Ser Leu Ser Phe Gin Ala 
900 905 910 



Leu Ser Glu Ser Gin Glu Glu Asn Lys Ala Asp Asn Leu Val Asn Leu 
915 920 925 



Lys He Pro Leu Leu Tyr Asp Ala Glu He His Leu Thr Arg Ser Thr 
930 935 940 



Asn He Asn Phe Tyr Glu He Ser Ser Asp Gly Asn Val Pro Ser He 
945 950 955 960 



Val His Ser Phe Glu Asp Val Gly Pro Lys Phe He Phe Ser Leu Lys 
965 970 975 



Val Thr Thr Gly Ser Val Pro Val Ser Met Ala Thr Val He He His 
980 985 990 



He Pro Gin Tyr Thr Lys Glu Lys Asn Pro Leu Met Tyr Leu Thr Gly 
995 1000 1005 



Val Gin Thr Asp Lys Ala Gly Asp He Ser Cys Asn Ala Asp He 
1010 1015 1020 



Asn Pro Leu Lys He Gly Gin Thr Ser Ser Ser Val Ser Phe Lys 
1025 1030 1035 
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Ser Glu Asn Phe Arg His Thr Lys Glu Leu Asn Cys Arg Thr Ala 
1040 1045 1050 



Ser Cys Ser Asn Val Thr Cys Trp Leu Lys Asp Val His Met Lys 
1055 1060 1065 



Gly Glu Tyr Phe Val Asn Val Thr Thr Arg lie Trp Asn Gly Thr 
1070 1075 1080 



Phe Ala Ser Ser Thr Phe Gin Thr Val Gin Leu Thr Ala Ala Ala 
1085 1090 1095 



Glu He Asn Thr Tyr Asn Pro Glu He Tyr Val He Glu Asp Asn 
1100 1105 1110 



Thr Val Thr He Pro Leu Met He Met Lys Pro Asp Glu Lys Ala 
1115 1120 1125 



Glu Val Pro Thr Gly Val He He Gly Ser lie lie Ala Gly lie 
1130 1135 1140 



Leu Leu Leu Leu Ala Leu Val Ala He Leu Trp Lys Leu Gly Phe 
1145 1150 1155 



Phe Lys Arg Lys Tyr Glu Lys Met Thr Lys Asn Pro Asp Glu He 
1160 H65 H70 



Asp Glu Thr Thr Glu Leu Ser Ser 
1175 1180 



<210> 78 

<211> 332 

<212> PRT 

<213> Homo Sapiens 

<400> 78 

Met Tyr Arg Pro Arg Ala Arg Ala Ala Pro Glu Gly Arg Val Arg Gly 
1 5 10 15 



Cys Ala Val Pro Ser Thr Val Leu Leu Leu Leu Ala Tyr Leu Ala Tyr 
20 25 30 



Leu Ala Leu Gly Thr Gly Val Phe Trp Thr Leu Glu Gly Arg Ala Ala 
35 40 45 



Gin Asp Ser Ser Arg Ser Phe Gin Arg Asp Lys Trp Glu Leu Leu Gin 
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Asn Phe Thr Cys Leu Asp Arg Pro Ala Leu Asp Ser Leu lie Arg Asp 
65 70 75 80 

Val Val Gin Ala Tyr Lys Abu Gly Ala Ser Leu Leu Ser Asn Thr Thr 

85 90 95 



Ser Met Gly Arg Trp Glu Leu Val Gly Ser Phe Phe Phe Ser Val Ser 
100 105 no 



Thr lie Thr Thr He Gly Tyr Gly Asn Leu Ser Pro Asn Thr Met Ala 
115 120 



125 



Ala Arg Leu Phe Cys He Phe Phe Ala Leu Val Gly He Pro Leu Asn 
I 30 135 140 



Leu Val Val Leu Asn Arg Leu Gly His Leu Met Gin Gin Gly Val Asn 
145 I 50 155 leo 

His Trp Ala Ser Arg Leu Gly Gly Thr Tip Gin Asp Pro Asp Lys Ala 
165 i7 0 175 

Arg Trp Leu Ala Gly Ser Gly Ala Leu Leu Ser Gly Leu Leu Leu Phe 
180 185 190 

Leu Leu- Leu Pro Pro Leu Leu Phe Ser His Met Glu Gly Trp Ser Tyr 
I 95 200 205 

Thr Glu Gly Phe Tyr Phe Ala Phe lie Thr Leu Ser Thr Val Gly Phe 
210 215 220 



Gly Asp Tyr Val He Gly Met Asn Pro Ser Gin Arg Tyr Pro Leu Tro 
225 230 235 



240 



Tyr Lys Asn Met Val Ser Leu Trp He Leu Phe Gly Met Ala Trp Leu 
245 250 



255 



Ala Leu He He Lys Leu He Leu Ser Gin Leu Glu Thr Pro Gly Arq 
260 265 270 

Val Cys Ser Cys Cys His His Ser Ser Lys Glu Asp Phe Lys Ser Gin 
275 280 285 



Ser Trp Arg Gin Gly Pro Asp Arg Glu Pro Glu Ser His Ser Pro Gin 
29 0 295 300 
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Gin Gly Cys Tyr Pro Glu Gly Pro Met Gly lie lie Gin His Leu Glu 
305 



310 315 320 



Pro Ser Ala His Ala Ala Gly Cys Gly Lys Asp Ser 
325 330 



<210> 79 

<211> 328 

<212> PRT 

<213> Homo Sapiens 

<400> 79 



Met Glu Trp Asp Asn Gly Thr Gly Gin Ala Leu Gly Leu Pro Pro Thr 
c 10 15 



Thr Cys Val Tyr Arg Glu Asn Phe Lys Gin Leu Leu Leu Pro Pro Val 
20 25 30 

Tyr Ser Ala Val Leu Ala Ala Gly Leu Pro Leu Asn He Cys Val He 
35 40 45 

Thr Gin He Cys Thr Ser Arg Arg Ala Leu Thr Arg Thr Ala Val Tyr 
50 * 55 60 

Thr Leu Asn Leu Ala Leu Ala Asp Leu Leu Tyr Ala Cys Ser Leu Pro 
65 70 75 80 

Leu Leu He Tyr Asn Tyr Ala Gin Gly Asp His Trp Pro Phe Gly Asp 
85 90 95 

Phe Ala cys Arg Leu Val Arg Phe Leu Phe Tyr Ala Asn Leu His Gly 
100 105 HO 

Ser lie Leu Phe Leu Thr Cys. He Ser Phe Gin Arg Tyr Leu Gly He 
115 120 125 

Cys His Pro Leu Ala Pro Trp His Lys Arg Gly Gly Arg Arg Ala Ala 
130 135 140 

Trp Leu Val Cys Val Ala Val Trp Leu Ala Val Thr Thr Gin Cys Leu 
145 150 155 160 

Pro Thr Ala He Phe Ala Ala Thr Gly He Gin Arg Asn Arg Thr Val 
165 I 70 175 
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Cys Tyr Asp Leu Ser Pro Pro Ala Leu Ala Thr His Tyr Met Pro Tyr 
180 185 190 



Gly Met Ala Leu Thr Val He Gly Phe Leu Leu Pro Phe Ala Ala Leu 
195 200 205 



Leu Ala Cys Tyr Cys Leu Leu Ala Cys Arg Leu Cys Arg Gin Asp Gly 
210 215 220 



Pro Ala Glu Pro Val Ala Gin Glu Arg Arg Gly Lys Ala Ala Arg Met 
225 230 235 240 



Ala Val Val Val Ala Ala Ala Phe Ala lie Ser Phe Leu Pro Phe His 
245 250 255 



lie Thr Lys Thr Ala Tyr Leu Ala Val Arg Ser Thr Pro Gly Val Pro 
260 265 270 



Cys Thr Val Leu Glu Ala Phe Ala Ala Ala Tyr Lys Gly Thr Arg Pro 
275 280 285 



Phe Ala Ser Ala Asn Ser Val Leu Asp Pro He Leu Phe Tyr Phe Thr 
290 295 300 



Gin Lys Lys Phe Arg Arg Arg Pro His Glu Leu Leu Gin Lys Leu Thr 
305 310 315 320 



Ala Lys Trp Gin Arg Gin Gly Arg 
325 



<211> 581 
<212> PRT 
<213> Homo Sapiens 

<400> 80 

Met Gin Arg Pro Gly Pro Arg Leu Trp Leu Val Leu Gin Val Met Gly 
1 5 10 is 



Ser Cys Ala Ala He Ser Ser Met Asp Met Glu Arg Pro Gly Asp Gly 
20 25 30 



Lys Cys Gin Pro He Glu He Pro Met Cys Lys Asp He Gly Tyr Asn 
35 40 45 



Met Thr Arg Met Pro Asn Leu Met Gly His Glu Asn Gin Arg Glu Ala 
50 55 60 
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Ala lie G m Leu His Glu P he Ala Pro Leu Val Glu ^ Qly His 

70 75 80 

Sly His Leu Arg Phe Phe Leu Cys Ser Leu Tyr Ala Pro Met Cys Thr 
85 9° 95 

Glu Gin Val Ser Thr Pro lie Pro Ala Cys Arg Val Met Cy S Glu Gin 
100 105 110 

Ala Arg Leu Lys Cys Ser Pro He Met Glu Gin Phe Asn Phe Lys Trp 

120 125 

Pro Asp Ser Leu Asp Cys Arg Lys Leu Pro Asn Lys Asn Asp Pro Asn 

135 140 

Tyr Leu Cys Met Glu Ala Pro Asn Asn Gly Ser Asp Glu Pro Thr Arg 

155 160 

Gly ser Gly Leu Phe Pro Pro Leu Phe Arg Pro Gin Arg Pro His Ser 
165 «0 175 

Ala Gin Glu His Pro Leu Lys Asp Gly Gly Pro Gly Arg Gly Gly cys 

185 19Q 

Asp Asn Pro Gly Lys Phe His His Val Glu Lys Ser Ala Ser Cys Ala 
^ 200 205 

Pro Leu cys Thr Pro Gly Val Asp Val Tyr Trp ser Arg «« A6p Lys 



220 



Arg Phe Ala Val Val Trp Leu Ala n P T , , T 

225 oon Trp Ala Val Leu c y s Phe Phe 

zju 235 



240 



Ser ser Ala Phe Thr Val Leu Thr Phe Leu lie Asp Pro Ala Arg Phe 
245 250 255 

Arg Tyr Pro Glu Arg Pro lie lie Phe Leu Ser Met Cys Tyr Cys Val 

265 270 

Tyr ser Val Gly Tyr Leu lie Arg Leu Phe Ala Gly Ala Glu Ser He 

280 285 

Ala Cys Asp Arg Asp Ser Gly Gin Leu Tyr Val Ue Gin Glu Gly Leu 

IBS 300 
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Glu Ser Thr Gly Cys Thr Leu Val Phe Lau Val Leu Tyr Tyr Phe Gly 

310 315 320 



Met Ala Ser Ser Leu Trp Trp Val Val Leu Thr Leu Thr Trp Phe 
325 



330 335 



Leu 



Ala Ala Gly Lys Lyg Trp Gly His Glu Ala He Glu Ala Asn Ser Ser 
340 345 350 

Tyr Phe His Leu Ala Ala Trp Ala He Pro Ala Val Lys Thr tie Leu 
55 36 <> 365 

He Leu Val Met Arg Arg Val Ala Gly Asp Glu Leu Thr Gly Val Cys 

375 380 

Tyr Val Gly Ser Met Asp Val Asn Ala Leu Thr Gly Phe Val Leu He 

390 395 400 



Pro Leu Ala Cys Tyr Leu Val He Gly Thr Ser Phe He Leu Ser Gly 
405 410 415 

Phe val Ala Leu Phe His He Arg Arg Val Met Lys Thr Gly Gly Glu 

425 430 

Asn Thr Asp Lys Leu Glu Lys Leu Met Val Arg He Gly Leu Phe Ser 
45 440 445 

Val Leu Tyr Thr Val Pro Ala Thr Cys Val He Ala Cys Tyr Phe Tyr 



4 55 460 



Glu Arg Leu Asn Met Asp Tyr Trp Lys He Leu Ala Ala Gin His Lys 



475 



480 



Cys Lys Met Asn Asn Gin Thr Lys Thr Leu Asp Cys Leu Met Ala Ala 
485 490 495 

Ser He Pro Ala Val Glu He Phe Met Val Lys He Phe Met Leu Leu 



Val Val Gly He Thr Ser Gly Met Trp He Trp Thr Ser Lys Thr Leu 

520 525 

Gin Ser Trp Gin Gin Val Cys Ser Arg Arg Leu Lys Lys Lys Ser Arg 



540 
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Arg Lys Pro Ala Ser Val lie Thr Ser Gly Gly He Tyr Lys Lys Ala 

Gin His Pro Gin Lys Thr His His Gly Lys Tyr Glu lie Pro Ala Gin 
565 570 



Ser Pro Thr Cys Val 
580 



<210> 81 

<211> 539 

<212> PRT 

<213> Homo sapiens 

<400> 81 

Met Val Pro Gly Ala Arg Gly Gly Gly Ala Leu Ala Arg Ala Ala Gly 
,5 10 15 



Arg Gly Leu Leu Ala Leu Leu Leu Ala Val Ser Ala Pro Leu Arg Leu 
2 0 25 

Gin Ala Glu Glu Leu Gly Asp Gly Cys Gly His Leu Val Thr Tyr Gin 
35 40 45 

Asp Ser Gly Thr Met Thr Ser Lys Asn Tyr Pro Gly Thr Tyr Pro Asn 



50 55 60 



His Thr Val Cys Glu Lys Thr He Thr Val Pro Lys Gly Lys Arg Leu 



65 



70 



lie Leu Arg Leu Gly Asp Leu Asp He Glu Ser Gin Thr Cys Ala Ser 
85 90 

Asp Tyr Leu Leu Phe Thr Ser Ser Ser Asp Gin Tyr Gly Pro Tyr Cys 

105 i - LU 



100 



Gly Ser Met Thr Val Pro Lys Glu Leu Leu Leu Asn Thr Ser Glu Val 
115 120 "5 



Thr Val Arg Phe Glu Ser Gly Ser His lie Ser Gly Arg Gly Phe Leu 
130 



135 140 



Leu Thr Tyr Ala Ser Ser Asp His Pro Asp Leu lie Thr Cys Leu Glu 
145 150 155 

Arg Ala Ser His Tyr Leu Lys Thr Glu Tyr Ser Lys Phe Cys Pro Ala 
165 170 
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Gly Cys Arg Asp Val Ala Gly Asp He Ser Gly Asn Met Val Asp Gly 
180 185 3.90 

Tyr Arg Asp Thr Ser Leu Leu Cys Lys Ala Ala He His Ala Gly lie 
195 200 205 

lie Ala Asp Glu Leu Gly Gly Gin He Ser Val Leu Gin Arg Lys Gly 
lie Ser Arg Tyr Glu Gly He Leu Ala Asn Gly Val Leu Ser Arg Asp 



235 



240 



Ser Ser 
270 



Gly Ser Leu Ser Asp Lys Arg Phe Leu Phe Thr Ser Asn Gly Cys Ser 
245 250 255 

Arg ser Leu Ser Phe Glu Pro Asp Gly Gin lie Arg Ala Ser 
260 265 270 

Trp Gin Ser Val Asn Glu Ser Gly Asp Gin Val His Trp Ser Pro Gly 
275 280 285 

Gin Ala Arg Leu Gin Asp Gin Gly Pro Ser Trp Ala Ser Gly Asp Ser 

295 300 

Ser Asn Asn His Lys Pro Arg Glu Trp Leu Glu He Asp Leu Gly Glu 
310 315 320 



Asn 
335 



Lys Lys Lys He Thr Gly He Arg Thr Thr Gly Ser Thr Gin Ser 
325 33° 335 

Phe Asn Phe Tyr Val Lys Ser Phe Val Met Asn Phe Lys Asn Asn Asn 



340 345 



350 



Ser Lys Trp Lys Thr Tyr Lys Gly He Val Asn Asn Glu Glu Lys Val 
355 360 3 65 

Phe Gin Gly Asn Ser Asn Phe Arg Asp Pro Val Gin Asn Asn Phe He 
370 375 380 

Pro Pro He Val Ala Arg Tyr Val Arg Val Val Pro Gin Thr Trp His 
390 395 * 4Q0 

Gin Arg He Ala Leu Lys Val Glu Leu He Gly Cys Gin He Thr Gin 
405 410 415 
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Gly Asn Asp Ser Leu Val Trp Arg Lys Thr Ser Gin Ser Thr Ser Val 
420 425 430 



Ser Thr Lys Lys Glu Asp Glu Thr lie Thr Arg Pro He Pro Ser Glu 

435 440 4 45 

Glu Thr Ser Thr Gly He Asn He Thr Thr Val Ala He Pro Leu Val 

450 455 460 



Leu Leu Val Val Leu Val Phe Ala Gly Met Gly He Phe Ala Ala Phe 
465 470 475 480 

Arg Lys Lys Lys Lys Lys Gly Ser Pro Tyr Gly Ser Ala Glu Ala Gin 
485 490 495 



Lys Thr Asp Cys Trp Lys Gin He Lys Tyr Pro Phe Ala Arg His Gin 
500 505 510 



Ser Ala Glu Phe Thr He Ser Tyr Asp Asn Glu Lys Glu Met Thr Gin 
515 520 525 



Lys Leu Asp Leu He Thr Ser Asp Met Ala Gly 
530 535 



<210> 82 

<211> 539 

<212> PRT 

<213> Homo sapiens 

<400> 82 

Met Val Pro Gly Ala Arg Gly Gly Gly Ala Leu Ala Arg Ala Ala Gly 
15 10 15 



Arg Gly Leu Leu Ala Leu Leu Leu Ala Val Ser Ala Pro Leu Arg Leu 
20 25 3 0 



Gin Ala Glu Glu Leu Gly Asp Gly Cys Gly His Leu Val Thr Tyr Gin 
35 40 45 



Asp Ser Gly Thr Met Thr Ser Lys Asn Tyr Pro Gly Thr Tyr Pro Asn 
50 55 60 



His Thr Val Cys Glu Lys Thr He Thr Val Pro Lys Gly Lys Arg Leu 
65 70 75 80 

He Leu Arg Leu Gly Asp Leu Asp He Glu Ser Gin Thr Cys Ala Ser 
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95 



Asp Tyr Leu Leu Phe Thr Ser Ser Ser Asp Gin Tyr Gly Pro Tyr Cys 
100 105 HO 



Gly Ser Met Thr Val Pro Lys Glu Leu Leu Leu Asn Thr Ser Glu Val 
115 120 125 

Thr Val Arg Phe Glu Ser Gly Ser His lie Ser Gly Arg Gly Phe Leu 
130 135 140 

Leu Thr Tyr Ala Ser Ser Asp His Pro Asp Leu He Thr Cys Leu Glu 
145 " 150 155 160 



Arg Ala Ser His Tyr Leu Lys Thr Glu Tyr Ser Lys Phe Cys Pro Ala 
165 170 175 



Gly Cys Arg Asp Val Ala Gly Asp He Ser Gly Asn Met Val Asp Gly 
180 185 190 



Tyr Arg Asp Thr Ser Leu Leu Cys Lys Ala Ala He His Ala Gly lie 
195 200 205 



He Ala Asp Glu Leu Gly Gly Gin He Ser Val Leu Gin Arg Lys Gly 
210 215 220 



He Ser Arg Tyr Glu Gly He Leu Ala Asn Gly Val Leu Ser Arg Asp 
225 " * 230 235 240 



Gly Ser Leu Ser Asp Lys Arg Phe Leu Phe Thr Ser Asn Gly Cys Ser 
245 250 255 



Arg Ser Leu Ser Phe Glu Pro Asp Gly Gin He Arg Ala Ser Ser Ser 
260 265 270 



Trp Gin Ser Val Asn Glu Ser Gly Asp Gin Val His Trp Ser Pro Gly 
275 280 285 



Gin Ala Arg Leu Gin Asp Gin Gly Pro Ser Trp Ala Ser Gly Asp Ser 
290 295 300 



Ser Asn Asn His Lys Pro Arg Glu Trp Leu Glu He Asp Leu Gly Glu 
305 310 315 320 



Lys Lys Lys He Thr Gly He Arg Thr Thr Gly Ser Thr Gin Ser Asn 
325 330 335 
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Phe Asn Phe Tyr Val Lys Ser Phe Val Met Asn Phe Lys Asn Asn Asn 
340 345 350 

Ser Lys Trp Lys Thr Tyr Lys Gly He Val Asn Asn Glu Glu Lys Val 
355 360 365 

Phe Gin Gly Asn Ser Asn Phe Arg Asp Pro Val Gin Asn Asn Phe He 
370 375 380 

Pro Pro He Val Ala Arg Tyr Val Arg Val Val Pro Gin Thr Trp His 
385 390 395 400 

Gin Arg He Ala Leu Lys Val Glu Leu He Gly Cys Gin He Thr Gin 
405 410 415 

Gly Asn Asp Ser Leu Val Trp Arg Lys Thr Ser Gin Ser Thr Ser Val 
420 425 430 

Ser Thr Lys Lys Glu Asp Glu Thr He Thr Arg Pro He Pro Ser Glu 
435 440 445 

Glu Thr Ser Thr Gly He Asn lie Thr Thr Val Ala He Pro Leu Val 
450 455 460 

Leu Leu Val Val Leu Val Phe Ala Gly Met Gly He Phe Ala Ala Phe 
465 470 475 480 

Arg Lys Lys Lys Lys Lys Gly Ser Pro Tyr Gly Ser Ala Glu Ala Gin 
485 490 495 

Lys Thr Asp Cys Trp Lys Gin He Lys Tyr Pro Phe Ala Arg His Gin 
500 505 510 

Ser Ala Glu Phe Thr He Ser Tyr Asp Asn Glu Lys Glu Met Thr Gin 
515 520 525 



Lys Leu Asp Leu He Thr Ser Asp Met Ala Gly 
530 535 



<210> 83 

<2H> 237 

<212> PRT 

<213> Homo Sapiens 



<400> 83 
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Met Ala Gly Val Ser Ala Cys He Lys Tyr Ser Met Phe Thr Phe Asn 
i 5 10 15 



Phe Leu Phe Trp Leu Cys Gly He Leu He Leu Ala Leu Ala He Trp 
20 25 30 



Val Arg Val Ser Asn Asp Ser Gin Ala He Phe Gly Ser Glu Asp Val 
35 40 45 



Gly Ser Ser Ser Tyr Val Ala Val Asp He Leu He Ala Val Gly Ala 
50 55 60 

He He Met He Leu Gly Phe Leu Gly Cys Cys Gly Ala He Lys Glu 
65 70 75 80 

Ser Arg Cys Met Leu Leu Leu Phe Phe He Gly Leu Leu Leu He Leu 
85 90 95 



Leu Leu Gin Val Ala Thr Gly He Leu Gly Ala Val Phe Lys Ser Lys 
100 105 HO 



Ser Asp Arg He Val Asn Glu Thr Leu Tyr Glu Asn Thr Lys Leu Leu 
115 120 125 

Ser Ala Thr Gly Glu Ser Glu Lys Gin Phe Gin Glu Ala He He Val 
130 135 140 



Phe Gin Glu Glu Phe Lys Cys Cys Gly Leu Val Asn Gly Ala Ala Asp 
145 150 155 160 

Trp Gly Asn Asn Phe Gin His Tyr Pro Glu Leu Cys Ala Cys Leu Asp 
165 170 175 



Lys Gin Arg Pro Cys Gin Ser Tyr Asn Gly Lys Gin Val Tyr Lys Glu 
180 185 190 



Thr Cys He Ser Phe He Lys Asp Phe Leu Ala Lys Asn Leu lie lie 
195 200 205 



Val He Gly He Ser Phe Gly Leu Ala Val lie Glu lie Leu Gly Leu 
210 215 220 



Val Phe Ser Met Val Leu Tyr Cys Gin He Gly Asn Lys 
225 230 235 
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<210> 84 

<211> 202 

<212> PRT 

<213> Homo Sapiens 

<400> 84 

Met Cys Thr Gly Gly Cys Ala Arg Cys Leu Gly Gly Thr Leu lie Pro 
1 5 10 15 



Leu Ala Phe Phe Gly Phe Leu Ala Asn He Leu Leu Phe Phe Pro Gly 
20 25 30 



Gly Lys Val He Asp Asp Asn Asp His Leu Ser Gin Glu He Trp Phe 
35 40 45 



Phe Gly Gly He Leu Gly Ser Gly Val Leu Met He Phe Pro Ala Leu 
, 50 55 60 



Val Phe Leu Gly Leu Lys Asn Asn Asp Cys Cys Gly Cys Cys Gly Asn 
65 ~ 70 75 80 



Glu Gly Cys Gly Lys Arg Phe Ala Met Phe Thr Ser Thr He Phe Ala 
85 90 95 



Val Val Gly Phe Leu Gly Ala Gly Tyr Ser Phe He He Ser Ala He 
100 105 HO 



Ser He Asn Lys Gly Pro Lys Cys Leu Met Ala Asn Ser Thr Trp Gly 
115 120 125 



Tyr Pro Phe His Asp Gly Asp Tyr Leu Asn Asp Glu Ala Leu Trp Asn 
130 135 140 



Lys Cys Arg Glu Pro Leu Asn Val Val Pro Trp Asn Leu Thr Leu Phe 
145 150 155 160 



Ser He Leu Leu Val Val Gly Gly He Gin Met Val Leu Cys Ala He 
165 170 175 



Gin Val Val. Asn Gly Leu Leu Gly Thr Leu Cys Gly Asp Cys Gin Cys 
180 185 190 



Cys Gly Cys Cys Gly Gly Asp Gly Pro Val 
195 200 



<210> 85 
<211> 677 
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<212> PRT 

<213> Homo Sapiens 

<400> 85 

Met Gin Pro Thr Leu Leu Leu Ser Leu Leu Gly Ala Val Gly Leu Ala 
15 10 15 

Ala Val Asn Ser Met Pro Val Asp Asn Arg Asn His Asn Glu Gly Met 
20 25 30 

Val Thr Arg Cys He He Glu Val Leu Ser Asn Ala Leu Ser Lys Ser 
35 40 45 

Ser Ala Pro Pro He Thr Pro Glu Cys Arg Gin Val Leu Lys Thr Ser 
50 55 60 

Arg Lys Asp Val Lys Asp Lys Glu Thr Thr Glu Asn Glu Asn Thr Lys 
65 *" 70 75 80 

Phe Glu Val Arg Leu Leu Arg Asp Pro Ala Asp Ala Ser Glu Ala His 
85 90 95 

Glu Ser Ser Ser Arg Gly Glu Ala Gly Ala Pro Gly Glu Glu Asp He 
100 105 HO 

Gin Gly Pro Thr Lys Ala Asp Thr Glu Lys Trp Ala Glu Gly Gly Gly 
115 120 125 

His Ser Arg Glu Arg Ala Asp Glu Pro Gin Trp Ser Leu Tyr Pro Ser 
130 ~ ~ 135 140 

Asp Ser Gin Val Ser Glu Glu Val Lys Thr Arg His Ser Glu Lys Ser 
145 150 155 160 

Gin Arg Glu Asp Glu Glu Glu Glu Glu Gly Glu Asn Tyr Gin Lys Gly 
165 170 175 

Glu Arg Gly Glu Asp Ser Ser Glu Glu Lys His Leu Glu Glu Pro Gly 
180 185 190 

Glu Thr Gin Asn Ala Phe Leu Asn Glu Arg Lys Gin Ala Ser Ala He 
195 200 205 



Lys Lys Glu Glu Leu Val Ala Arg Ser Glu Thr His Ala Ala Gly His 
210 215 220 
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Ser 
225 



Gin Glu Lys Thr His Ser Arg Glu Lys Ser Ser Gin Glu Ser Gly 



230 235 



240 



Glu Glu Ala Gly Ser Gin Glu Asn His Pro Gin Glu Ser Lys Gly Gin 
245 250 255 

Pro Arg Ser Gin Glu Glu Ser Glu Glu Gly Glu Glu Asp Ala Thr Ser 
260 265 270 



Glu val Asp Lys Arg Arg Thr Arg Pro Arg His His His Gly Arg Ser 
275 280 285 

Arg Pro Asp Arg Ser Ser Gin Gly Gly Ser Leu Pro Ser Glu Glu Lys 
290 " 295 30° 



Gly His Pro Gin Gin Gin Ser Glu Glu Ser Asn Val Ser Met Ala Ser 
305 310 315 320 

Leu Gly Glu Lys Arg Asp His His Ser Thr His Tyr Arg Ala Ser Glu 



325 



330 335 



Glu Glu Pro Glu Tyr Gly Glu Glu lie Lys Gly Tyr Pro Gly Val Gin 
340 345 350 

Ala Pro Glu Asp Leu Glu Trp Glu Arg Tyr Arg Gly Arg Gly Ser Glu 
355 360 365 

Glu Tyr Arg Ala Pro Arg Pro Gin Ser Glu Glu Ser Trp Asp Glu Glu 
370 375 380 

Asp Lys Arg Asn Tyr Pro Ser Leu Glu Leu Asp Lys Met Ala His Gly 

Tvr Gly Glu Glu Ser Glu Glu Glu Arg Gly Leu Glu Pro Gly Lys Gly 
-> ■ 410 415 



405 



Arg His His Arg Gly Arg Gly Gly Glu Pro Arg Ala Tyr Phe Met Ser 
420 «5 430 

Asp Thr Arg Glu Glu Lys Arg Phe Leu Gly Glu Gly His His Arg Val 
435 440 445 

Gin Glu Asn Gin Met Asp Lys Ala Arg Arg His Pro Gin Gly Ala Trp 
450 455 460 

Lys Glu Leu Asp Arg Asn Tyr Leu Asn Tyr Gly Glu Glu Gly Ala Pro 
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465 



470 475 «80 



Gly Lys Trp Gin Gin Gin Gly Asp Leu Gin Asp Thr- Lys Glu Asn Arg 



Glu Glu Ala Arg Phe Gin Asp Lys Gin Tyr Ser Ser His His Thr Ala 
500 505 50 

Glu Lys Arg Lys Arg Leu Gly Glu Leu Phe Asn Pro Tyr Tyr Asp Pro 
• 520 5Zb 



515 



Leu Gin Trp Lys Ser Ser His Phe Glu Arg Arg Asp Asn Met Asn Asp 
530 



535 540 



Asn Phe Leu Glu Gly Glu Glu Glu Asn Glu Leu Thr Leu Asn Glu Lys 
545 550 555 

Asn Phe Phe Pro Glu Tyr Asn Tyr Asp Trp Trp Glu Lys Lys Pro Phe 



565 



570 



Ser Glu Asp Val Asn Trp Gly Tyr Glu Lys Arg Asn Leu Ala Arg Val 
580 585 ~~ 

Pro Lys Leu Asp Leu Lys Arg Gin Tyr Asp Arg Val Ala Gin Leu Asp 



595 



600 



Gin Leu Leu His Tyr Arg Lys Lys Ser Ala Glu Phe Pro Asp Phe Tyr 
610 515 520 

Asp ser Glu Glu Pro Val Ser Thr His Gin Glu Ala Glu Asn Glu Lys 



625 



630 



Asp Arg Ala Asp Gin Thr Val Leu Thr Glu Asp Glu Lys Lys Glu Leu 
645 "° 

Glu Asn Leu Ala Ala Met Asp Leu Glu Leu Gin Lys He Ala Glu Lys 
660 665 



Phe Ser Gin Arg Gly 
675 



<210> 86 

<211> 631 

<212> PRT 

<213> Homo Sapiens 



<400> 86 



WO 2004/073657 



186/282 



PCT/US2004/005455 



Met Lys Leu Leu Cys Glu Gly Leu Lys Gin Pro Asn Cys Val Leu Gin 
1 5 10 15 



Thr Leu Arg Trp Tyr Arg Cys Leu lie Ser Ser Ala Ser Cys Gly Ala 
20 25 30 



Leu Ala Ala Val Leu Ser Thr Ser Gin Trp Leu Thr Glu Leu Glu Phe 
35 40 45 



Ser Glu Thr Lys Leu Glu Ala Ser Ala Leu Lys Leu Leu Tyr Gly Gly 
50 " 55 60 



Leu Lys Asp Pro Asn Cys Lys Leu Gin Lys Leu Asn Leu Gin Phe Ser 
65 ~ 70 75 80 

Leu Ser Val Thr Ala Ala Lys Leu Pro Val Gly Met Val Gly Asn Cys 
85 90 95 



Ser Gly Phe Ser Gly Ser Leu Val Gin Ser His Phe Gly Tyr Cys Gin 
100 105 HO 



Asp Ser Ser Phe Lys Cys Asp Leu Cys Lys Leu Leu Trp Pro Ser Thr 
115 120 125 



Arg Val Ala Ala Ala Lys Asp Cys Gly Ser Pro Lys Ser Phe Leu Ser 
130 135 140 

Glu Gly Leu Asn Trp Ala Gly Arg Leu Glu Ala Val Glu Glu Val Leu 
145 150 155 160 

Gly Leu Gly Val Leu Val Gin Pro Gly Asp Pro Ala Ser Gin Gly Gly 
165 170 175 



Gly His Cys Glu Asn Tyr Gly Ser Phe Arg Asp Leu Val Asp Leu Glu 
180 185 190 



Val Lys Ala Glu Pro Ser Leu Arg Lys Gly Gly Met Asp Leu Gin Arg 
195 " 200 205 



Pro Thr Leu Gin Val Val Leu Leu Cys Lys lie Phe Ser Leu Lys Leu 
2io 215 220 

Phe Leu Phe lie Ala Leu Pro Asn Ser Pro Gly Gin Val Ser Val Val 
225 230 235 240 
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Gin Val Thr lie Pro Asp Gly Phe Val Asn Val Thr Val Gly Ser Asn 
245 ^ 250 255 



Val Thr Leu lie Cys lie Tyr Thr Thr Thr Val Ala Ser Arg Glu Gin 
260 265 270 



Leu Ser He Gin Trp Ser Phe Phe His Lys Lys Glu Met Glu Pro He 
275 280 285 



Ser Ser Pro Trp Glu Glu Gly Lys Trp Pro Asp Val Glu Ala Val Lys 
290 295 300 



Gly Thr Leu Asp Gly Gin Gin Ala Glu Leu Gin He Tyr Phe Ser Gin 
305 310 315 * 320 



Gly Gly Gin Ala Val Ala He Gly Gin Phe Lys Asp Arg He Thr Gly 
325 330 335 



Ser Asn Asp Pro Gly Asn Ala Ser He Thr He Ser His Met Gin Pro 
340 345 350 



Ala Asp Ser Gly He Tyr He Cys Asp Val Asn Asn Pro Pro Asp Phe 
355 360 365 



Leu Gly Gin Asn Gin Gly He Leu Asn Val Ser Val Leu Val Lys Pro 
370 375 380 



Ser Lys Pro Leu Cys Ser Val Gin Gly Arg Pro Glu Thr Gly His Thr 
385 390 395 400 



He Ser Leu Ser Cys Leu Ser Ala Leu Gly Thr Pro Ser Pro Val Tyr 
405 410 415 



Tyr Trp His Lys Leu Glu Gly Arg Asp He Val Pro Val Lys Glu Asn 
420 425 430 



Phe Asn Pro Thr Thr Gly lie Leu Val He Gly Asn Leu Thr Asn Phe 
435 440 445 



Glu Gin Gly Tyr Tyr Gin Cys Thr Ala He Asn Arg Leu Gly Asn Ser 
450 455 460 



Ser Cys Glu He Asp Leu Thr Ser Ser His Pro Glu Val Gly He He 
465 470 475 480 



Val Gly Ala Leu He Gly Ser Leu Val Gly Ala Ala lie lie He Ser 
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485 490 495 

Val Val Cys Phe Ala Arg Asn Lys Ala Lys Ala Lys Ala Lys Glu Arg 
500 505 510 

Asn Ser Lys Thr lie Ala Glu Leu Glu Pro Met Thr Lys lie Asn Pro 
515 520 525 

Arg Gly Glu Ser Glu Ala Met Pro Arg Glu Asp Ala Thr Gin Leu Glu 
530 535 540 

Val Thr Leu Pro Ser Ser lie His Glu Thr Gly Pro Asp Thr lie Gin 
545 550 555 560 

Glu Pro Asp Tyr Glu Pro Lys Pro Thr Gin Glu Pro Ala Pro Glu Pro 
565 570 575 

Ala Pro Gly Ser Glu Pro Met Ala Val Pro Asp Leu Asp He Glu Leu 
. 580 585 590 

Glu Leu Glu Pro Glu Thr Gin Ser Glu Leu Glu Pro Glu Pro Glu Pro 
595 600 605 

Glu Pro Glu Ser Glu Pro Gly Val Val Val Glu Pro Leu Ser Glu Asp 
610 615 620 

Glu Lys Gly Val Val Lys Ala 
625 630 



<210> 87 

<211> 413 

<212> PRT 

<213> Homo Sapiens 

<400> 87 

Met Val Phe Ala Phe Trp Lys Val Phe Leu He Leu Ser Cys hen Ala 
1 5 10 15 

Glv Gin Val Ser Val Val Gin Val Thr He Pro Asp Gly Phe Val Asn 
20 25 30 

Val Thr Val Gly Ser Asn Val Thr Leu He Cys He Tyr Thr Thr Thr 
35 40 45 

Val Ala Ser Arg Glu Gin Leu Ser He Gin Trp Ser Phe Phe His Lys 
50 55 60 



-ion 
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Ly s Qlu Met Glu Pro lie Ser Ser Pro Trp Glu Glu Gly Lys Trp Pro 
65 70 75 

As p val Glu Ala Val Lys Gly Thr Leu Asp Gly Gin Gin Ala Glu Leu 
85 90 

Gin lie Tyr Phe Ser Gin Gly Gly Gin Ala Val Ala He Gly Gin Phe 
100 105 



Lys Asp Arg He Thr Gly Ser Asn Asp Pro Gly Asn Ala Ser He Thr 
115 120 

He Ser His Met Gin Pro Ala Asp Ser Gly He Tyr He Cys Asp Val 
130 135 

AS n Asn Pro Pro Asp Phe Leu Gly Gin Asn Gin Gly He Leu Asn Val 
145 150 155 

S er val Leu Val Lys Pro Ser Lys Pro Leu Cys Ser Val Gin Gly Arg 
165 170 

Pro Glu Thr Gly His Thr He Ser Leu Ser Cys Leu Ser Ala Leu Gly 
180 185 

Thr Pro Ser Pro Val Tyr Tyr Trp His Lys Leu Glu Gly Arg Asp He 
19S 200 zu:> 

val Pro val Lys Glu Asn Phe Asn Pro Thr Thr Gly He Leu Val He 
210 215 2 

Gly Asn Leu Thr Asn Phe Glu Gin Gly Tyr Tyr Gin Cys Thr Ala He 
225 230 " 5 

A sn Arg Leu Gly Asn Ser Ser Cys Glu He Asp Leu Thr Ser Ser His 
245 250 

Pro Glu Val Gly He He Val Gly Ala Leu He Gly Ser Leu Val Gly 
260 265 

Ala Ala He He He Ser Val Val Cys Phe Ala Arg Asn Lys Ala Lys 
275 280 

Ala Lys Ala Lys Glu Arg Asn Ser Lys Thr He Ala Glu Leu Glu Pro 
290 295 
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Met Thr Lys He Asn Pro Arg Gly Glu Ser Glu Ala Met Pro Arg Glu 

305 310 315 320 



Asp Ala Thr Gin Leu Glu Val Thr Leu Pro Ser Ser He His Glu Thr 
325 330 335 



Gly Pro Asp Thr He Gin Glu Pro Asp Tyr Glu Pro Lys Pro Thr Gin 
340 345 350 



Glu Pro Ala Pro Glu Pro Ala Pro Gly Ser Glu Pro Met Ala Val Pro 
355 360 365 



Asp Leu Asp He Glu Leu Glu Leu Glu Pro Glu Thr Gin Ser Glu Leu 
370 375 380 



Glu Pro Glu Pro Glu Pro Glu Pro Glu Ser Glu Pro Gly Val Val Val 
385 390 395 400 



Glu Pro Leu Ser Glu Asp Glu Lys Gly Val Val Lys Ala 
405 410 



<210> 88 

<211> 397 

<212> PRT 

<213> Homo Sapiens 

<4 00> 88 

Met Arg Ser Pro Ser Ala Ala Trp Leu Leu Gly Ala Ala lie Leu Leu 
15 10 15 



Ala Ala Ser Leu Ser Cys Ser Gly Thr He Gin Gly Thr Asn Arg Ser 
20 25 30 



Ser Lys Gly Arg Ser Leu He Gly Lys Val Asp Gly Thr Ser His Val 
35 40 45 



Thr Gly Lys Gly Val Thr Val Glu Thr Val Phe Ser Val Asp Glu Phe 
50 55 60 



Ser Ala Ser Val Leu Thr Gly Lys Leu Thr Thr Val Phe Leu Pro He 
65 70 75 80 



Val Tyr Thr He Val Phe Val Val Gly Leu Pro Ser Asn Gly Met Ala 
85 90 95 



Leu Trp 



Val Phe 
100 



Leu Phe Arg Thr Lys Lys Lys His Pro Ala Val He 
105 110 
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Tyr Met Ala Asn Leu Ala Leu Ala Asp Leu Leu Ser Val He Trp Phe 
115 120 125 

Pro Leu Lys He Ala Tyr His He His Ala Asn Asn Trp He Tyr Gly 
130 135 140 

Glu Ala Leu Cys Asn Val Leu He Gly Phe Phe Tyr Gly Asn Met Tyr 
145 150 155 16° 

Cys Ser He Leu Phe Met Thr Cys Leu Ser Val Gin Arg Tyr Trp Val 
165 170 175 

He Val Asn Pro Met Gly His Ser Arg Lys Lys Ala Asn He Ala He 
180 185 I 90 

Gly He Ser Leu Ala He Trp Leu Leu He Leu Leu Val Thr He Pro 
195 200 205 

Leu Tyr Val Val Lys Gin Thr He Phe He Pro Ala Leu Asn He Thr 
210 215 220 

Thr Cys His Asp Val Leu Pro Glu Gin Leu Leu Val Gly Asp Met Phe 
225 230 235 240 

Asn Tyr Phe Leu Ser Leu Ala He Gly Val Phe Leu Phe Pro Ala Phe 
245 250 255 

Leu Thr Ala Ser Ala Tyr Val Leu Met He Arg Met Leu Arg Ser Ser 
260 265 270 

Ala Met Asp Glu Asn Ser Glu Lys Lys Arg Lys Arg Ala He Lys Leu 
275 280 285 

He Val Thr Val Leu Ala Met Tyr Leu He Cys Phe Thr Pro Ser Asn 
290 295 300 



Leu Leu Leu Val Val His Tyr Phe Leu He Lys Ser Gin Gly Gin Ser 
305 



310 315 320 



His Val Tyr Ala Leu Tyr He Val Ala Leu Cys Leu Ser Thr Leu Asn 
325 330 335 

Ser Cys He Asp Pro Phe Val Tyr Tyr Phe Val Ser His Asp Phe Arg 
340 345 350 
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Asp His Ala Lys Asn Ala Leu Leu Cys Arg Ser Val Arg Thr Val Lys 
355 360 365 



Gin Met Gin Val Ser Leu Thr Ser Lys Lys His Ser Arg Lys Ser Ser 
370 375 380 



Ser Tyr Ser Ser Ser Ser Thr Thr Val Lys Thr Ser Tyr 
385 390 395 



<210> 89 

<211> 1560 

<212> PRT 

<213> Homo Sapiens 

<400> 89 

Met Pro Cys Ala Gin Arg Ser Trp Leu Ala Asn Leu Ser Val Val Ala 
15 10 15 



Gin Leu Leu Asn Phe Gly Ala Leu Cys Tyr Gly Arg Gin Pro Gin Pro 
20 25 ~ 30 



Gly Pro Val Arg Phe Pro Asp Arg Arg Gin Glu His Phe lie Lys Gly 
35 40 45 



Leu Pro Glu Tyr His Val Val Gly Pro Val Arg Val Asp Ala Ser Gly 
50 55 60 



His Phe Leu Ser Tyr Gly Leu His Tyr Pro lie Thr Ser Ser Arg Arg 
65 70 75 80 



Lys Arg Asp Leu Asp Gly Ser Glu Asp Trp Val Tyr Tyr Arg lie Ser 
85 90 " 95 



His Glu Glu Lys Asp Leu Phe Phe Asn Leu Thr Val Asn Gin Gly Phe 
100 105 no 



Leu Ser Asn Ser Tyr He Met Glu Lys Arg Tyr Gly Asn Leu Ser His 
115 120 ~ 125 



Val Lys Met Met Ala Ser Ser Ala Pro Leu Cys His Leu Ser Gly Thr 
130 135 " 140 



Val Leu Gin Gin Gly Thr Arg Val Gly Thr Ala Ala Leu Ser Ala Cys 
145 150 155 160 



His Gly Leu Thr Gly Phe Phe Gin Leu Pro His Gly Asp Phe Phe He 
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01o „ vai LVS W s Hi, « - •! -» « ^ "J "° 
180 x 

^ „ « m„ Thr Lvs Glu Pro Thr Cys 
lie val Tyr Arg Arg Gin Lys Val Pro Glu Thr Lys ^ 

195 ^ uu 

, « ri, <! P r Gin Lys Gin Glu Leu Trp Arg 
Gly Leu Lys Asp Ser Val Asn He Ser Gin y ^ 

210 215 

o=>- ara Ser Leu Ser Arg Arg 
Glu Lys Trp Glu Arg His Asn Leu Pro Ser Arg ^ 

225 230 

,, ! m« Thr Leu Val val Ala Asp Thr 
ser lie Ser Lys Glu Arg Trp Val Glu Thr Leu ^ 
245 

• m cor Glu Asn Val Glu Ser Tyr He Leu 
Met lie Glu Tyr His Gly Ser Glu Asn va ^ 

Phe His Asn Pro Ser He Gly 



hvs Met lie v,xu — « 

260 



M-f Thr Gly Leu Phe wis m»u «w 

Thr He Met Asn Met Val Ttir tay 2Q5 

275 280 

n He His lie Val Val Val Arg Leu He Leu Leu Glu Glu Glu 
Asn Ala He His xxe voj. 30Q 

290 295 

oxu ox, m » w n. v.! Hi- - « S Ly * " *" S 

305 310 

Tlo acm Pro lys Ser Asp Leu Asn Pro 
Phe Cys Lys Trp Gin Lys Ser He Asn Pro Ly ^ 
325 

val Hi, His M p « "a « - « » « «• - £ <*° "* 
340 J * 3 

« Glu Thr Leu Gly Leu Ser His Leu Ser Gly 

Gly Phe Asn Arg Pro Cys Glu Thr ue ^ 
355 . 360 

• Tv „ <;„r Cvs Asn He Asn Glu Asp Ser Gly Leu 
Met Cys Gin Pro Has Arg Ser Cys Asn ^ 

370 37 

Pre l=« ^ ~ ~ n- »» - G1 » « S " iS My S 

385 390 

«. Hi, « «V «. - - « * s S *" ~ ^ « "° 

405 



/ 
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Tyr lie Met Ser Arg Gin Leu Gin Tyr Asp Pro Thr Pro Leu Thr Trp 
420 425 430 



Ser Lys Cys Ser Glu Glu Tyr lie Thr Arg Phe Leu Asp Arg Gly Trp 
435 440 445 



Gly Phe Cys Leu Asp Asp lie Pro Lys Lys Lys Gly Leu Lys Ser Lys 
450 455 460 



Val lie Ala Pro Gly Val lie Tyr Asp Val His His Gin Cys Gin Leu 
465 470 475 480 



Gin Tyr Gly Pro Asn Ala Thr Phe Cys Gin Glu Val Glu Asn Val Cys 
485 490 495 



Gin Thr Leu Trp Cys Ser Val Lys Gly Phe Cys Arg Ser Lys Leu Asp 
500 505. 510 



Ala Ala Ala Asp Gly Thr Gin Cys Gly Glu Lys Lys Trp Cys Met Ala 
515 520 525 



Gly Lys Cys lie Thr Val Gly Lys Lys Pro Glu Ser lie Pro Gly Gly 
530 535 540 



Trp Gly Arg Trp Ser Pro Trp Ser His Cys Ser Arg Thr Cys Gly Ala 
545 550 555 560 



Gly Val Gin Ser Ala Glu Arg Leu Cys Asn Asn Pro Glu Pro Lys Phe 
565 570 575 



Gly Gly Lys Tyr Cys Thr Gly Glu Arg Lys Arg Tyr Arg Leu Cys Asn 
580 585 590 



Val His Pro Cys Arg Ser Glu Ala Pro Thr Phe Arg Gin Met Gin Cys 
595 600 605 



Ser Glu Phe Asp Thr Val Pro Tyr Lys Asn Glu Leu Tyr His Trp Phe 
610 615 620 



Pro lie Phe Asn Pro Ala His Pro Cys Glu Leu Tyr Cys Arg Pro lie 
625 630 635 * " ~ 640 



Asp Gly Gin Phe Ser Glu Lys Met Leu Asp Ala Val He Asp Gly Thr 
645 650 655 
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Pro Cys Phe Glu Gly Gly Asn Ser Arg Asn Val Cys He Asn Gly lie 
660 665 670 

Cys Lys Met Val Gly Cys Asp Tyr Glu lie Asp Ser Asn Ala Thr Glu 
675 680 685 

Asp Arg Cys Gly Val Cys Leu Gly Asp Gly Ser Ser Cys Gin Thr Val 
690 A S95 • 700 



Arg Lys Met Phe Lys Gin Lys Glu Gly Ser Gly Tyr Val Asp lie Gly 

710 715 720 



705 



Leu lie Pro Lys Gly Ala Arg Asp lie Arg Val Met Glu lie Glu Gly 
725 730 735 



Ala Gly Asn Phe Leu Ala lie Arg Ser Glu Asp Pro Glu Lys Tyr Tyr 
740 745 750 



Leu Asn Gly Gly Phe lie lie Gin Trp Asn Gly Asn Tyr Lys Leu Ala 
755 760 765 

Gly Thr Val Phe Gin Tyr Asp Arg Lys Gly Asp Leu Glu Lys Leu Met 
770 775 780 

Ala Thr Gly Pro Thr Asn Glu Ser Val Trp He Gin Leu Leu Phe Gin 
785 790 795 - 800 

Val Thr Asn Pro Gly He Lys Tyr Glu Tyr Thr He Gin Lys Asp Gly 
805 810 815 



Leu Asp Asn Asp Val Glu Gin Met Tyr Phe Trp Gin Tyr Gly His Trp 
820 825 830 



Thr Glu Cys Ser Val Thr Cys Gly Thr Gly He Arg Arg Gin Thr Ala 
835 840 845 



His Cys He Lys Lys Gly Arg Gly Met Val Lys Ala Thr Phe Cys Asp 
850 855 860 



Pro Glu Thr Gin Pro Asn Gly Arg Gin Lys Lys Cys His Glu Lys Ala 
865 870 875 880 



Cys Pro Pro Arg Trp Trp Ala Gly Glu Trp Glu Ala Cys Ser Ala Thr 
885 890 895 
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Cys Gly Pro His Gly Glu Lys Lys Arg Thr Val Leu Cys lie Gin Thr 
900 905 910 



Met Val Ser Asp Glu Gin Ala Leu Pro Pro Thr Asp Cys Gin His Leu 
915 920 925 



Leu Lys Pro Lys Thr Leu Leu Ser Cys Asn Arg Asp lie Leu Cys Pro 
930 935 940 



Ser Asp Trp Thr Val Gly Asn Trp Ser Glu Cys Ser Val Ser Cys Gly 
945 ~ 950 955 960 



Gly Gly Val Arg lie Arg Ser Val Thr Cys Ala Lys Asn His Asp Glu 
965 970 975 



Pro Cys Asp Val Thr Arg Lys Pro Asn Ser Arg Ala Leu Cys Gly Leu 
980 985 990 



Gin Gin Cys Pro Ser Ser Arg Arg Val Leu Lys Pro Asn Lys Gly Thr 
995 1000 1005 



lie Ser Asn Gly Lys Asn Pro Pro Thr Leu Lys Pro Val Pro Pro 
1010 1015 1020 



Pro Thr Ser Arg Pro Arg Met Leu Thr Thr Pro Thr Gly Pro Glu 
1025 1030 1035 



Ser Met Ser Thr Ser Thr Pro Ala lie Ser Ser Pro Ser Pro Thr 
1040 1045 1050 



Thr Ala Ser Lys Glu Gly Asp Leu Gly Gly Lys Gin Trp Gin Asp 
1055 1060 1065 



Ser Ser Thr Gin Pro Glu Leu Ser Ser Arg Tyr Leu He Ser Thr 
1070 1075 1080 



Gly Ser Thr Ser Gin Pro He Leu Thr Ser Gin Ser Leu Ser He 
1085 1090 1095 



Gin Pro Ser Glu Glu Asn Val Ser Ser Ser Asp Thr Gly Pro Thr 
1100 1105 1110 



Ser Glu Gly Gly Leu Val Ala Thr Thr Thr Ser Gly Ser Gly Leu 
1115 1120 1125 



Ser Ser Ser Arg Asn Pro He Thr Trp Pro Val Thr Pro Phe Tyr 
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1130 1«5 

Asn Thr Leu Thr Lys Gly Pro Glu Met Olu He His Ser Gly Ser 



1150 



1145 

Gly Glu Glu Arg Glu Gin Pro Glu Asp Lys Asp Glu Ser Asn Pro 
1160 1170 

Val He Trp Thr Lys lie Arg Val Pro Gly Asn Asp Ala Pro Val 
1175 H80 1185 



Glu Ser Thr Glu Met Pro Leu Ala Pro Pro Leu Thr^ Pro Asp Leu 
1190 



1195 I 200 



Ser Arg Glu Ser Trp Trp Pro Pro Phe Ser Thr Val Met Glu Gly 
1205 1210 1215 

L eu Leu Pro Ser Gin Arg Pro Thr Thr Ser Glu Thr Gly Thr Pro 
1220 1225 123° 

Arg Val Glu Gly Met Val Thr Glu Lys Pro Ala Asn Thr Leu Leu 
1235 1240 12« 

Pro Leu Gly Gly Asp His Gin Pro Glu Pro Ser Gly Lys Thr Ala 
1250 1255 1260 



Asn Arg Asn His Leu Lys Leu Pro Asn Asn Met Asn Gin Thr Lys 
1265 1270 1275 

Ser Ser Glu Pro Val Leu Thr Glu Glu Asp Ala Thr Ser Leu He 
1280 1285 129° 

Thr Glu Gly Phe Leu Leu Asn Ala Ser Asn Tyr Lys Gin Leu Thr 



1295 



1300 1305 



Asn Gly His Gly Ser Ala His Trp He Val Gly Asn Trp Ser Glu 
1310 "IS I 320 

Cys Ser Thr Thr Cys Gly Leu Gly Ala Tyr Trp Lys Arg Val Glu 
1325 1330 1335 

Cys Thr Thr Gin Met Asp Ser Asp Cys Ala Ala lie Gin Arg Pro 
1340 " 4S 1350 

Asp Pro Ala Lys Arg cys His Leu Arg Pro Cys Ala Gly Trp Lys 
1355 I 360 1365 
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Val Gly Asn Trp Ser Lys Cys Ser Arg Asn Cys Ser Gly Gly Phe 
1370 1375 1380 



Lys lie Arg Glu He Gin Cys Val Asp Ser Arg Asp His Arg Asn 
1385 1390 1395 



Leu Arg Pro Phe His Cys Gin Phe Leu Ala Gly He Pro Pro Pro 
1400 1405 1410 



Leu Ser Met Ser Cys Asn Pro Glu Pro Cys Glu Ala Trp Gin Val 
• 1415 1420 1425 



Glu Pro Trp Ser Gin Cys Ser Arg Ser Cys Gly Gly Gly Val Gin 
■ 1430 1435 1440 



Glu Arg Gly Val Phe Cys Pro Gly Gly Leu Cys Asp Trp Thr Lys 
1445 1450 1455 



Arg Pro Thr Ser Thr Met Ser Cys Asn Glu His Leu Cys Cys His 
1460 1465 1470 



Trp Ala Thr Gly Asn Trp Asp Leu Cys Ser Thr Ser Cys Gly Gly 
1475 • 1480 1485 



Gly Phe Gin Lys Arg He Val Gin Cys Val Pro Ser Glu Gly Asn 
1490 1495 1500 



Lys Thr Glu Asp Gin Asp Gin Cys Leu Cys Asp His Lys Pro Arg 
1505 1510 1515 



Pro Pro Glu Phe Lys Lys Cys Asn Gin Gin Ala Cys Lys Lys Ser 
1520 1525 1530 



Ala Asp Leu Leu Cys Thr Lys Asp Lys Leu Ser Ala Ser Phe Cys 
1535 1540 1545 



Gin Thr Leu Lys Ala Met Lys Lys Cys Ser Val Pro 
1550 1555 1560 



<210> 90 

<211> 96 

<212> PRT 

<213> Homo Sapiens 



<400> 



90 
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Met Cys Cys Thr Lys Ser Leu Leu Leu Ala Ala Leu Met Ser Val Leu 
x 5 10 15 

Leu Leu His Leu Cys Gly Glu Ser Glu Ala Ala Ser Asn Phe Asp Cys 
20 25 30 

Cys Leu Gly Tyr Thr Asp Arg He Leu His Pro Lys Phe He Val Gly 
35 40 45 

Phe Thr Arg Gin Leu Ala Asn Glu Gly Cys Asp He Asn Ala He He 
50 ~ 55 60 

Phe His Thr Lys Lys Lys Leu Ser Val Cys Ala Asn Pro Lys Gin Thr 
65 70 75 80 

Trp Val Lys Tyr He Val Arg Leu Leu Ser Lys Lys Val Lys Asn Met 
85 90 95 



<210> 91 

<211> 336 

<212> PRT 

<213> Homo Sapiens 

<400> 91 

Met Leu Gin Ser Leu Ala Gly Ser Ser Cys Val Arg Leu Val Glu Arg 
1 5 10 15 



His Arg Ser Ala Trp Cys Phe Gly Phe Leu Val Leu Gly Tyr Leu Leu 
20 25 30 

Tyr Leu Val Phe Gly Ala Val Val Phe Ser Ser Val Glu Leu Pro Tyr 
35 40 45 

Glu Asp Leu Leu Arg Gin Glu Leu Arg Lys Leu Lys Arg Arg Phe Leu 
50 55 SO 

Glu Glu His Glu Cys Leu Ser Glu Gin Gin Leu Glu Gin Phe Leu Gly 
65 70 75 80 

Arq Val Leu Glu Ala Ser Asn Tyr Gly Val Ser Val Leu Ser Asn Ala 
85 90 95 

Ser Gly Asn Trp Asn Trp Asp Phe Thr Ser Ala Leu Phe Phe Ala Ser 
100 " 105 HO 



Thr Val Leu Ser Thr Thr Gly Tyr Gly His Thr Val Pro Leu Ser Asp 
115 120 125 
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Gly Gly Lys Ala Phe Cys He He Tyr Ser Val He Gly He Pro Phe 
130 135 ' 140 



Thr Leu Leu Phe Leu Thr Ala Val Val Gin Arg lie Thr Val His Val 
145 150 155 160 



Thr Arg Arg Pro Val Leu Tyr Phe His He Arg Trp Gly Phe Ser Lys 
165 170 175 



Gin Val Val Ala He Val His Ala Val Leu Leu Gly Phe Val Thr Val 
180 185 190 



Ser Cys Phe Phe Phe He Pro Ala Ala Val Phe Ser Val Leu Glu Asp 
195 200 205 



Asp Trp Asn Phe Leu Glu Ser Phe Tyr Phe Cys Phe He Ser Leu Ser 
210 215 220 



Thr He Gly Leu Gly Asp Tyr Val Pro Gly Glu Gly Tyr Asn Gin Lys 
225 230 4 235 240 



Phe Arg Glu Leu Tyr Lys He Gly He Thr Cys Tyr Leu Leu Leu Gly 
245 250 255 



Leu He Ala Met Leu Val Val Leu Glu Thr Phe Cys Glu Leu His Glu 
260 265 270 



Leu Lys Lys Phe Arg Lys Met Phe Tyr Val Lys Lys Asp Lys Asp Glu 
275 280 285 



Asp Gin Val His He He Glu His Asp Gin Leu Ser Phe Ser Ser He 
290 295 300 



Thr Asp Gin Ala Ala Gly Met Lys Glu Asp Gin Lys Gin Asn Glu Pro 
305 310 315 320 



Phe Val Ala Thr Gin Ser Ser Ala Cys Val Asp Gly Pro Ala Asn His 
325 330 335 



<210> 92 

<211> 103 

<212> PRT 

<213> Homo Sapiens 

<400> 92 
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Met 81* Thr Thr Asn Gly Thr 61* Thr Trp Tyr Olu Ser Leu His Ala 
n 5 10 



Val Leu Lys Ala Leu Asn Ala Thr Leu His Ser Asn Leu Leu Cys Arg 
20 25 



Pro Gly Pro Gly Leu Gly Pro Asp Asn Gin Thr Glu Olu Arg Arg Ala 
35 40 4b 



Ser Leu Pro Gly Arg Asp Asp Asn Ser Tyr Met Tyr He Leu Phe Val 
• 55 60 



50 



Met Phe Leu Phe Ala Val Thr Val Gly Ser Leu lie Leu Gly Tyr Thr 

70 75 



65 



Arg Ser 



Arg Lys Val Asp Lys Arg Ser Asp Pro Tyr His Val Tyr lie 
85 90 



Lys Asn Arg Val Ser Met He 
100 



<210> 93 

<211> 4590 

<212> PRT 

<213> Homo Sapiens 

<400> 93 

Met Gly Arg His Leu Ala Leu Leu Leu Leu Leu Leu Leu Leu Phe Gin 

His Phe Gly Asp Ser Asp Gly Ser Gin Arg Leu Glu Gin Thr Pro Leu 
20 25 

Gin Phe Thr His Leu Glu Tyr Asn Val Thr val Gin Glu Asn Ser Ala 
35 40 45 

Ala Lys Thr Tyr Val Gly His Pro Val Lys Met Gly Val Tyr lie Thr 
50 55 60 

His Pro Ala Trp Glu Val Arg Tyr Lys lie Val Ser Gly Asp Ser Glu 
65 70 75 

A sn Leu Phe Lys Ala Glu Glu Tyr lie Leu Gly Asp Phe Cys Phe Leu 
85 90 

Arg He Arg Thr Lys Gly Gly Asn Thr Ala He Le* Asn Arg Glu Val 



100 
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Lys Asp His Tyr Thr Leu lie Val Lys Ala Leu Glu Lys Asn Thr Asn 
115 120 125 



Val Glu Ala Arg Thr Lys Val Arg Val Gin Val Leu Asp Thr Asn Asp 
130 135 140 



Leu Arg Pro Leu Phe Ser Pro Thr Ser Tyr Ser Val Ser Leu Pro Glu 
145 150 155 160 



Asn Thr Ala lie Arg Thr Ser lie Ala Arg Val Ser Ala Thr Asp Ala 
165 170 175 



Asp lie Gly Thr Asn Gly Glu Phe Tyr Tyr Ser Phe Lys Asp Arg Thr 
180 185 190 



Asp Met Phe Ala lie His Pro Thr Ser Gly Val He Val Leu Thr Gly 
195 200 205 



Arg Leu Asp Tyr Leu Glu Thr Lys Leu Tyr Glu Met Glu He Leu Ala 
210 215 220 



Ala Asp Arg Gly Met Lys Leu Tyr Gly Ser Ser Gly He Ser Ser Met 
225 230 235 240 



Ala Lys Leu Thr Val His He Glu Gin Ala Asn Glu Cys Ala Pro Val 
245 250 255 



He Thr Ala Val Thr Leu Ser Pro Ser Glu Leu Asp Arg Asp Pro Ala 
260 265 270 



Tyr Ala He Val Thr Val Asp Asp Cys Asp Gin Gly Ala Asn Gly Asp 
275 280 285 



He Ala Ser Leu Ser He Val Ala Gly Asp Leu Leu Gin Gin Phe Arg 
290 295 300 



Thr Val Arg Ser Phe Pro Gly Ser Lys Glu Tyr Lys Val Lys Ala He 
305 310 315 320 



Gly Asp lie Asp Trp Asp Ser His Pro Phe Gly Tyr Asn Leu Thr Leu 
325 330 335 



Gin Ala Lys Asp Lys Gly Thr Pro Pro Gin Phe Ser Ser Val Lys Val 
340 345 350 
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lie His Val Thr Ser Pro Gin Phe Lys Ala Gly Pro Val Lys Phe Glu 
355 360 365 



Lys Asp Val Tyr Arg Ala Glu He Ser Glu Phe Ala Pro Pro Asn Thr 
370 375 380 



Pro Val Val Met Val Lys Ala lie Pro Ala Tyr Ser His Leu Arg Tyr 
385 390 395 400 



Val Phe Lys Arg Thr Pro Gly Lys Ala Lys Phe Ser Leu Asn Tyr Asn 
405 410 415 



Thr Gly Leu He Ser He Leu Glu Pro Val Lys Arg Gin Gin Ala Ala 
42 0 425 43 0 



His Phe Glu Leu Glu Val Thr Thr Ser Asp Arg Lys Ala Ser Thr Lys 
435 440 445 



Val Leu Val Lys Val Leu Gly Ala Asn Ser Asn Pro Pro Glu Phe Thr 
450 455 460 



Gin Thr Ala Tyr Lys Ala Ala Phe Asp Glu Asn Val Pro He Gly Thr 
465 470 475 480 



Thr He Met Ser Leu Ser Ala Val Asp Pro Asp Glu Gly Glu Asn Gly 
485 490 495 



Tyr Val Thr Tyr Ser He Ala Asn Leu Asn His Val Pro Phe Ala He 
500 505 510 



Asp His Phe Thr Gly Ala Val Ser Thr Ser Glu Asn Leu Asp Tyr Glu 
515 520 525 



Leu Met Pro Arg Val Tyr Thr Leu Arg He Arg Ala Ser Asp Trp Gly 
530 535 540 



Leu Pro Tyr Arg Arg Glu Val Glu Val Leu Ala Thr He Thr Leu Asn 
545 550 555 560 



Asn Leu Asn Asp Asn Thr Pro Leu Phe Glu Lys He Asn Cys Glu Gly 
565 570 575 



Thr lie Pro Arg Asp Leu Gly Val Gly Glu Gin He Thr Thr Val Ser 
580 585 590 



WO 2004/073657 PCT/US2004/005455 

204/282 

Ala He Asp Ala Asp Glu Leu Gin Leu Val Gin Tyr Gin He Glu Ala 
595 600 605 



Gly Asn Glu Leu Asp Leu Phe Ser Leu Asn Pro Asn Ser Gly Val Leu 
610 615 620 



Ser Leu Lys Arg Ser Leu Met Asp Gly Leu Gly Ala Lys Val Ser Phe 
625 630 635 640 



His Ser Leu Arg He Thr Ala Thr Asp Gly Glu Asn Phe Ala Thr Pro 
645 650 655 



Leu Tyr He Asn He Thr Val Ala Ala Ser His Lys Leu Val Asn Leu 
660 665 670 



Gin Cys Glu Glu Thr Gly Val Ala Lys Met Leu Ala Glu Lys Leu Leu 
675 680 685 



Gin Ala Asn Lys Leu His Asn Gin Gly Glu Val Glu Asp He Phe Phe 
690 695 700 



Asp Ser His Ser Val Asn Ala His He Pro Gin Phe Arg Ser Thr Leu 
705 • 710 715 720 



Pro Thr Gly He Gin Val Lys Glu Asn Gin Pro Val Gly Ser Ser Val 
725 730 735 



He Phe Met Asn Ser Thr Asp Leu Asp Thr Gly Phe Asn Gly Lys Leu 
740 745 750 



Val Tyr Ala Val Ser Gly Gly Asn Glu Asp Ser Cys Phe Met He Asp 
755 760 765 



Met Glu Thr Gly Met Leu Lys He Leu Ser Pro Leu Asp Arg Glu Thr 
770 775 780 



Thr Asp Lys Tyr Thr Leu Asn He Thr Val Tyr Asp Leu Gly He Pro 
785 790 795 800 



Gin Lys Ala Ala Trp Arg Leu Leu His Val Val Val Val Asp Ala Asn 
805 810 815 



Asp Asn Pro Pro Glu Phe Leu Gin Glu Ser Tyr Phe Val Glu Val Ser 
820 825 830 



Glu Asp Lys Glu Val His Ser Glu He He Gin Val Glu Ala Thr Asp 
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Lys Asp Leu Gly Pro Asn Gly His Val Thr Tyr Ser He Leu Thr Asp 
850 8SS 860 

Thr Asp Thr Phe Ser lie Asp Ser Val Tbr Gly Val Val Asn lie Ala 
865 870 875 880 

Arg Pro Leu Asp Arg Glu Leu Gin His Glu His Ser Leu Lys He Glu 
885 890 895 

Ala Arg Asp Gin Ala Arg Glu Glu Pro Gin Leu Phe Ser Thr Val Val 
900 905 91° 

Val Lys Val Ser Leu Glu Asp Val Asn Asp Asn Pro Pro Thr Phe He 
915 920 925 . 

Pro Pro Asn Tyr Arg Val Lys Val Arg Glu Asp Leu Pro Glu Gly Thr 
930 ' 935 940 

Val He Met Trp Leu Glu Ala His Asp Pro Asp Leu Gly Gin Ser Gly 
945 950 955 960 

Gin Val Arg Tyr Ser Leu Leu Asp His Gly Glu Gly Asn Phe Asp Val 
- 965 970 975 

Asp Lys Leu Ser Gly Ala Val Arg He Val Gin Gin Leu Asp Phe Glu 
980 985 990 

Lvs Lys Gin Val Tyr Asn Leu Thr val Arg Ala Lys Asp Lys Gly Lys 
1 995 1000 1005 

Pro Val Ser Leu Ser Ser Thr Cys Tyr Val Glu Val Glu Val Val 
1010 1015 1020 

Asp Val Asn Glu Asn Leu His Pro Pro Val Phe Ser Ser Phe Val 
1025 1030 1035 

Glu Lys Gly Thr Val Lys Glu Asp Ala Pro Val Gly Ser Leu Val 
1040 1045 1050 

Met Thr Val Ser Ala His Asp Glu Asp Ala Gly Arg Asp Gly Glu 
1055 1060 1065 



He Arg Tyr Ser He Arg Asp Gly Ser Gly Val Gly Val Phe Lys 
1070 1075 1080 
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lie Gly Glu Glu Thr Gly Val lie Glu Thr Ser Asp Arg Leu Asp 
1085 109Q 1095 



Arg Glu Ser Thr Ser His Tyr Trp Leu Thr Val Phe Ala Thr Asp 
1100 H05 1110 



Gin Gly Val Val Pro Leu Ser Ser Phe lie Glu lie Tyr He Glu 
1115 1120 1125 



Val Glu Asp Val Asn Asp Asn Ala Pro Gin Thr Ser Glu Pro Val 
1130 H35 1140 



Tyr Tyr Pro Glu He Met Glu Asn Ser Pro Lys Asp Val Ser Val 
1145 H50 1155 



Val Gin He Glu Ala Phe Asp Pro Asp Ser Ser Ser Asn Asp Lys 
1160 1165 H70 



Leu Met Tyr Lys He Thr Ser Gly Asn Pro Gin Gly Phe Phe Ser 
1175 H80 H85 



He His Pro Lys Thr Gly Leu He Thr Thr Thr Ser Arg Lys Leu 
1190 H95 1200 



Asp Arg Glu Gin Gin Asp Glu His He Leu Glu Val Thr Val Thr 
1205 1210 1215 



Asp Asn Gly Ser Pro Pro Lys Ser Thr He Ala Arg Val He Val 
1220 1225 1230 



Lys He Leu Asp Glu Asn Asp Asn Lys Pro Gin Phe Leu Gin Lys 
1235 1240 1245 



Phe Tyr Lys He Arg Leu Pro Glu Arg Glu Lys Pro Asp Arg Glu 
1250 1255 1260 



Arg Asn Ala Arg Arg Glu Pro Leu Tyr Arg Val He Ala Thr Asp 
1265 1270 1275 



Lys Asp Glu Gly Pro Asn Ala Glu He Ser Tyr Ser He Glu Asp 
1280 1285 1290 



Gly Asn Glu His Gly Lys Phe Phe He Glu Pro Lys Thr Gly Val 
1295 1300 1305 
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Val Ser Ser Lys Arg Phe Ser Ala Ala Gly fllu Tyr Asp lie Leu 
1310 I 3 " 1320 

Ser lie Lys Ala Val Asp Asn Gly Arg Pro Gin Lys Ser Ser Thr 
1325 "3° 

Thr Arg Leu His lie Glu Trp He Ser Lys Pro Lys Gin Ser Leu 
1340 1345 • 1350 

Glu Pro lie Ser Phe Glu Glu Ser Phe Phe Thr Phe Thr Val Met 
1355 1360 1365 

Glu Ser Asp Pro Val Ala His Met lie Gly Val lie Ser Val Glu 
1370 1375 1380 

Pro Pro Gly lie Pro Leu Trp Phe Asp lie Thr Gly Gly Asn Tyr 
1385 1390 "95 

Asp ser His Phe Asp Val Asp Lys Gly Thr Gly Thr lie lie Val 
1400 "OS I 410 

Ala Lys Pro Leu Asp Ala Glu Gin Lys Ser Asn Tyr Asn Leu Thr 
1415 1420 I 425 

val Glu Ala Thr Asp Gly Thr Thr Thr He Leu Thr Gin Val Phe 
1430 1435 1440 

He Lys val He Asp Thr Asn Asp His Arg Pro Gin Phe Ser Thr 
. 1445 1450 1455 

Ser Lys Tyr Glu Val Val lie Pro Glu Asp Thr Ala Pro Glu Thr 
1460 1465 1470 

Glu He Leu Gin He Ser Ala Val Asp Gin Asp Glu Lys Asn Lys 
1475 1480 1485 

Leu lie Tyr Thr Leu Gin Ser Ser Arg Asp Pro Leu Ser Leu Lys 
1490 1495 1500 

Lys Phe Arg Leu Asp Pro Ala Thr Gly Ser Leu Tyr Thr Ser Glu 
1505 1510 I 515 

Lys Leu Asp His Glu Ala Val Ser Pro Ala His Leu Thr Val Met 
1520 1525 1530 
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Val Arg Asp Gin Asp Val Pro Val Lys Arg Asn Phe Ala Arg He 
1535 1540 1545 

Val Val Asn Val Ser Asp Thr Asn Asp His Ala Pro Trp Phe Thr 
1550 1555 1560 

Ala Ser Ser Tyr Lys Gly Arg Val Tyr Glu Ser Ala Ala Val Gly 
156 5 1570 2575 

Ser Val Val Leu Gin Val Thr Ala Leu Asp Lys Asp Lys Gly Lys 
1580 1585 1590 

Asn Ala Glu Val Leu Tyr Ser He Glu Ser Gly Asn He Gly Asn 
1595 1600 i 6 05 

He Gly Asn Ser Phe Met He Asp Pro Val Leu Gly Ser He Lys 
1610 1615 i 6 2o 

Thr Ala Lys Glu Leu Asp Arg Ser Asn Gin Ala Glu Tyr Asp Leu 
1625 1630 1635 

Met Val Lys Ala Thr Asp Lys Gly Ser Pro Pro Met Ser Glu He 
1640 1645 1650 

Thr Ser Val Arg He Phe Val Thr He Ala Asp Asn Ala Ser Pro 
• 1655 1660 i 6 65 

Lys Phe Thr ser Lys Glu Tyr ser Val Glu Leu Ser Glu Thr Val 
1670 1675 1680 

Ser He Gly Ser Phe Val Gly Met Val Thr Ala His Ser Gin Ser 
1685 1690 1695 

Ser Val Val Tyr Glu lie Lys Asp Gly Asn Thr Gly Asp Ala Phe 
1700 1705 mo 

Asp He Asn Pro His Ser Gly Thr He He Thr Gin Lys Ala Leu 
1715 1720 1725 

Asp Phe Glu Thr Leu Pro He Tyr Thr Leu He He Gin Gly Thr 
1730 1735 i7 40 

Asn Met Ala Gly Leu Ser Thr Asn Thr Thr Val Leu Val His Leu 
1745 1750 17S5 



Asp Glu Asn Asp Asn Ala Pro Val Phe Met Gin Ala Glu 
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1760 1765 1770 



Thr Gly Leu He Ser Glu Ser Ala Ser He Asn Ser Val Val Leu 
1775 1780 1785 



Thr Asp Arg Asn Val Pro Leu Val lie Arg Ala Ala Asp Ala Asp 
1790 1795 1800 



Lys Asp Ser Asn Ala Leu Leu Val Tyr His He Val Glu Pro Ser 
1805 1810 1815 



Val His Thr Tyr Phe Ala He Asp Ser Ser Thr Gly Ala He His 
1820 1825 1830 



Thr Val Leu Ser Leu Asp Tyr Glu Glu Thr Ser He Phe His Phe 
1835 1840 1845 



Thr Val Gin Val His Asp Met Gly Thr Pro Arg Leu Phe Ala Glu 
1850 1855 1860 



Tyr Ala Ala Asn Val Thr Val His Val He Asp lie Asn Asp Cys 
1865 1870 1875 



Pro Pro Val Phe Ala Lys Pro Leu Tyr Glu Ala Ser Leu Leu Leu 
1880 1885 1890 



Pro Thr Tyr Lys Gly Val Lys Val He Thr Val Asn Ala Thr Asp 
1895 1900 1905 



Ala Asp Ser Ser Ala Phe Ser Gin Leu He Tyr Ser He Thr Glu 
1910 1915 1920 



Gly Asn He Gly Glu Lys Phe Ser Met Asp Tyr Lys Thr Gly Ala 
1925 1930 ~ 1935 



Leu Thr Val Gin Asn Thr Thr Gin Leu Arg Ser Arg Tyr Glu Leu 
1940 1945 1950 



Thr Val Arg Ala Ser Asp Gly Arg Phe Ala Gly Leu Thr Ser Val 
1955 1960 1965 



Lys He Asn Val Lys Glu Ser Lys Glu Ser His Leu Lys Phe Thr 
1970 1975 1980 



Gin Asp Val Tyr Ser Ala 
1985 



Val 
1990 



Val 



Lys Glu Asn Ser Thr Glu Ala 
1995 



WO 2004/073657 



210/282 



PCT/US2004/005455 



Glu Tte Leu Ala Val He Thr Ala lle 61y Ser Pro Ilg Agn ^ 

2005 2010 

Pro Leu p he Tyr Hi s He Leu Asn p ro Asp ^ Arg phe Lys ^ 

2020 2025 

ser jjg T hr ser Gly val ^ ^ ^ ^ ^ 

2035 2 04 0 

Arg Glu s Gin Gin Glu Ala Asp Val Val Val Qlu ^ ^ ^ 



2055 



Glu Hi. nys Pro Ser Ala Val Ala His Val Val Val Lys val lie 

2065 2070 

Val jlu Gln Asn Asp Ala ^ ^ ftgn ^ 

2080 2085 

*** ^ HI ASP ^ Glu ™ «y His val xie 

2095 2100 

*» Tyr vax Ttr M , Vil ,.p _ ^ ser 01y arg Aan olu 

2110 2115 
Val His Tyr Tyr Leu Lys Glu His His Glu His Phe Gin He Gly 

2125 2130 

Pro go Gly Glu He Ser Leu Lys Lys Gln Phe Glu Leu Agp Thr 

2140 2145 

Leu js^ Lys Glu ^ Leu Thr ^ ^ ^ ^ ^ ^ ^ 



2155 2160 



Asn p™ Ala Phe Ser Ala Glu Val He Val Pro He Thr Val Met 

2170 2175 

Asn gj Ala Met Pro Val Phe Glu Lys Pro Phe Tyr Ser Ala Glu 

2X55 2190 

He jij Glu ser He Gln Val His Ser Pro Val val His Val Gln 

^ D0 2205 

Ala As^ Ser Pro Glu Gly Leu Lys Val Phe Tyr Ser He Thr Asp 



2220 
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«?5 ^ Thr 116 ASn Phe Asn ^ Oly Val 

2225 2230 2235 

He Asn Val n e Ala Pro Leu Asp Phe Glu Ala His Pro Ala ^ 

2245 2250 

Lys Leu Ser lie Arg Ala Thr Asp Ser Leu Thr Gly Ala Hia Ala 

2260 2265 



Glu vjl Phe val Asp lie lie Val Asp Asp He Asn Asp Asn Pro 

2275 2280 

Pro Val Phe Ala Gin G l n Ser Tyr Ala VaJ Thr Leu ^ ^ ^ 

2290 2295 

Ser Val He Gly Thr Ser Val Val Gln Val Arg Ala Thr Asp Ser 

2305 2 310 

Asp ser Glu Pro Asn Arg Gly u e Ser ^ Gln Mefc phe flly ^ 

2320 2325 

His ser Lys Ser His Asp His Phe His Val Asp Ser Ser Thr Gly 

2335 2340 

Leu Ile_ ser Leu Leu Arg Thr^ Leu Asp Tyr Glu Gln ser Arg Gln 



2355 



His Thr He Phe Val Arg Ala Val Asp Gly Gly Met Pro Thr Leu 

2365 2370 

Ser Ser Asp Val lie Val Thr Val Asp Val Thr Asp Leu Asn Gly 

2380 2385 

Asn Pro Pro Leu Phe Glu Gln Gln He Tyr Glu Ala Arg He Ser 

2395 2400 

Glu His_ Ala Pro His Gly Hi^ Phe Val Thr Cys Val^ Lys Ala Tyr 

ASP ^20 ^ ASP 116 ASP hyS Leu Gln "e Leu 

2425 



2430 



Ser Gly Asn Asp His Lys His Phe Val lie Asp Ser Ala Thr Gly 
^ 435 2440 2445 
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He He Thr Leu Ser Asn Leu His Arg His Ala Leu Lys Pro Phe 
2450 2455 2460 

Tyr Ser Leu Asn Leu Ser Val Ser Asp Gly Val Phe Arg Ser Ser 
2465 2470 2475 

Thr Gin Val His Val Thr Val He Gly Gly Asn Leu His Ser Pro 
24*0 2485 2490 

Ala Phe Leu Gin Asn Glu Tyr Glu Val Glu Leu Ala Glu Asn Ala 
2495 2500 2 505 

Pro Leu His Thr Leu Val Met Glu Val Lys Thr Thr Asp Gly Asp 
2510 2515 " 2520 

Ser Gly He Tyr Gly His Val Thr Tyr His lie Val Asn Asp Phe 
2525 2530 2535 

Ala Lys Asp Arg Phe Tyr He Asn Glu Arg Gly Gin lie Phe Thr 
2540 2545 2550 

Leu Glu Lys Leu Asp Arg Glu Thr Pro Ala Glu Lys Val He Ser 
2555 2560 2565 

Val Arg Leu Met Ala Lys Asp Ala Gly Gly Lys Val Ala Phe Cys 
2570 2575 2 580 

Thr Val Asn Val He Leu Thr Asp Asp Asn Asp Asn Ala Pro Gin 
2585 2590 2 595 

Phe Arg Ala Thr Lys Tyr Glu Val Asn He Gly Ser Ser Ala Ala 
2600 2605 2610 

Lys Gly Thr Ser Val Val Lys Ser Ala Ser Asp Ala Asp Glu Gly 
2615 2620 2625 

Ser Asn Ala Asp He Thr Tyr Ala He Glu Ala Asp Ser Glu Ser 
2630 2635 2 640 

Val Lys Glu Asn Leu Glu He Asn Lys Leu Ser Gly Val He Thr 
2645 2650 2 655 

Thr Lys Glu Ser Leu He Gly Leu Glu Asn Glu Phe Phe Thr Phe 
2660 2665 2670 



Phe Val Arg Ala Val Asp Asn Gly Ser Pro Ser Lys Glu Ser Val 
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Val Leu Val Tyr Val Ly s He Leu Pro Pro Glu Met Gin Leu Pro 
2690 2695 2700 



Lys Phe Ser Glu Pro Phe Tyr Thr Phe Thr Val Ser Glu Asp Val 
2705 2710 2715 



Pro Val Gly Thr Glu lie Asp Leu lie Arg Ala Glu His Ser Gly 
2720 2725 2730 



Thr Val Leu Tyr Ser Leu Val Lys Gly Asn Thr Pro Glu Ser Asn 
2735 2740 2745 



Arg Asp Glu Ser Phe Val He Asp Arg Gin Ser Gly Arg Leu Lys 
2750 2755 2760 



Leu Glu Lys Ser Leu Asp His Glu Thr Thr Lys Trp Tyr Gin Phe 
2765 2770 2775 



Ser He Leu Ala Arg Cys Thr Gin Asp Asp His Glu Met Val Ala 
2780 2785 2790 



Ser Val Asp Val Ser He Gin Val Lys Asp Ala Asn Asp Asn Ser 
2795 2800 2805 



Pro Val Phe Glu Ser Ser Pro Tyr Glu Ala Phe He Val Glu Asn 
2810 2815 2820 



Leu Pro Gly Gly Ser Arg Val He Gin He Arg Ala Ser Asp Ala 
2825 2830 2835 



Asp Ser Gly Thr Asn Gly Gin Val Met Tyr Ser Leu Asp Gin Ser 
2840 2845 2850 



Gin Ser Val Glu Val He Glu Ser Phe Ala lie Asn Met Glu Thr 
2855 2860 2865 



Gly Trp lie Thr Thr Leu Lys Glu Leu Asp His Glu Lys Arg Asp 
2870 2875 2880 



Asn Tyr Gin He Lys Val Val Ala Ser Asp His Gly Glu Lys He 
2885 2890 2895 



Gin Leu Ser Ser Thr Ala He Val Asp Val Thr Val Thr Asp Val 
2900 2905 2910 
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Asn Asp Ser Pro Pro Arg Phe Thr Ala Glu lie Tyr Lys Gly Thr 
2915 2920 2925 



Val Ser Glu Asp Asp Pro Gin Gly Gly Val He Ala lie Leu Ser 
2930 2935 " 2940 



Thr Thr Asp Ala Asp Ser Glu Glu lie Asn Arg Gin Val Thr Tyr 
2945' 2950 2955 



Phe He Thr Gly Gly Asp Pro Leu Gly Gin Phe Ala Val Glu Thr 
2960 2965 2970 



He Gin Asn Glu Trp Lys Val Tyr Val Lys Lys Pro Leu Asp Arg 
2975 2980 2985 



Glu Lys Arg Asp Asn Tyr Leu Leu Thr He Thr Ala Thr Asp Gly 
2990 2995 3000 



Thr Phe Ser Ser Lys Ala He Val Glu Val Lys Val Leu Asp Ala 
• 3005 3010 3015 



Asn Asp Asn Ser Pro Val Cys Glu Lys Thr Leu Tyr Ser Asp Thr 
3020 3025 3030 



He Pro Glu Asp Val Leu Pro Gly Lys Leu He Met Gin He Ser 
3035 3040 . 3045 



Ala Thr Asp Ala Asp He Arg Ser Asn Ala Glu He Thr Tyr Thr 
3050 3055 3060 



Leu Leu Gly Ser Gly Ala Glu Lys Phe Lys Leu Asn Pro Asp Thr 
3065 3070 3075 



Gly Glu Leu Lys Thr Ser Thr Pro Leu Asp Arg Glu Glu Gin Ala 
■ 3080 3085 3090 



Val Tyr His Leu Leu Val Arg Ala Thr Asp Gly Gly Gly Arg Phe 
3095 3100 3105 



Cys Gin Ala Ser He Val Val Thr Leu Glu Asp Val Asn Asp Asn 
3110 3115 " 3120 



Ala Pro Glu Phe Ser Ala Asp Pro Tyr Ala He Thr Val Phe Glu 
3125 3130 3135 
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Asn Thr Glu Pro Gly Thr Leu Leu Thr Arg Val Gin Ala Thr Asp 
3140 3145 3150 



Ala Asp Ala Gly Leu Asn Arg Lys lie Leu Tyr Ser Leu lie Asp 
3155 3160 3165 



Ser Ala Asp Gly Gin Phe Ser He Asn Glu Leu Ser Gly He He 
3170 3175 3180 



Gin Leu Glu Lys Pro Leu Asp Arg Glu Leu Gin Ala Val Tyr Thr 
3185 3190 3195 



Leu Ser Leu Lys Ala Val Asp Gin Gly Leu Pro Arg Arg Leu Thr 
3200 3205 3210 



Ala Thr Gly Thr Val He Val Ser Val Leu Asp lie Asn Asp Asn 
3215 3220 3225 



Pro Pro Val Phe Glu Tyr Arg Glu Tyr Gly Ala Thr Val Ser Glu 
3230 3235 3240 



Asp He Leu Val Gly Thr Glu Val Leu Gin Val Tyr Ala Ala Ser 
3245 3250 3255 



Arg Asp He Glu Ala Asn Ala Glu He Thr Tyr Ser He He Ser 
3260 3265 ^ 3270 



Gly Asn Glu His Gly Lys Phe Ser He Asp Ser Lys Thr Gly Ala 
3275 3280 3285 



Val Phe He lie <31u Asn Leu Asp Tyr Glu Ser Ser His Glu Tyr 
3290 3295 3300 



Tyr Leu Thr Val Glu Ala Thr Asp Gly Gly Thr Pro Ser Leu Ser 
3305 3310 3315 



Asp Val Ala Thr Val Asn Val Asn Val Thr Asp He Asn Asp Asn 
3320 3325 3330 



Thr Pro Val Phe Ser Gin Asp Thr Tyr Thr Thr Val He Ser Glu 
3335 3340 3345 



Asp Ala Val Leu Glu Gin Ser Val He Thr Val Met Ala Asp Asp 
3350 3355 3360 
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Ala Asp Gly Pro Ser Asn Ser His lie His Tyr Ser lie lie Asp 
3365 3370 3375 

Gly Asn Gin Gly Ser Ser Phe Thr He Asp Pro Val Arg Gly Glu 
3380 3385 3390 



Val Lys Val Thr Lys Leu Leu Asp Arg Glu Thr He Ser Gly Tyr 
3395 3400 ~ 3405 



Thr Leu Thr Val Gin Ala Ser Asp Asn Gly Ser Pro Pro Arg Val 
3410 3415 3420 



Asn Thr Thr Thr Val Asn He Asp Val Ser Asp Val Asn Asp Asn 
3425 3430 3435 



Ala Pro Val Phe Ser Arg Gly Asn Tyr Ser Val He He Gin Glu 
' 3440 3445 3450 



Asn Lys Pro Val Gly Phe Ser Val Leu Gin Leu Val Val Thr Asp 
3455 3460 3465 



Glu Asp Ser Ser His Asn Gly Pro Pro Phe Phe Phe Thr He Val 
3470 3475 3480 



Thr Gly Asn Asp Glu Lys Ala Phe Glu Val Asn Pro Gin Gly Val 
3485 3490 3495 



Leu Leu Thr Ser Ser Ala He Lys Arg Lys Glu Lys Asp His Tyr 
3500 3505 3510 



Leu Leu Gin Val Lys Val Ala Asp Asn Gly Lys Pro Gin Leu Ser 
3515 3520 3525 



Ser Leu Thr Tyr lie Asp lie Arg Val He Glu Glu Ser He Tyr 
3530 3535 3540 



Pro Pro Ala He Leu Pro Leu Glu He Phe He Thr Ser Ser Gly 
3545 3550 3555 



Glu Glu Tyr Ser Gly Gly Val lie Gly Lys He His Ala Thr Asp 
3560 3565 3570 



Gin Asp Val Tyr Asp Thr Leu Thr Tyr Ser Leu Asp Pro Gin Met 
3575 3580 3585 



Asp Asn Leu Phe Ser Val Ser Ser Thr Gly Gly Lys Leu lie Ala 
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His Lys Lys Leu Asp He Gly Gin Tyr Leu Leu Asn Val Ser Val 
3^05 3610 3615 



Thr Asp Gly Lys Phe Thr Thr Val Ala Asp lie Thr Val His He 
3620 3625 3630 



Arg Gin Val Thr Gin Glu Met Leu Asn His Thr He Ala He Arg 
3635 3640 3645 



Phe Ala Asn Leu Thr Pro Glu Glu Phe Val Gly Asp Tyr Trp Arg 
3650 3655 3660 



Asn Phe Gin Arg Ala Leu Arg Asn He Leu Gly Val Arg Arg Asn 
3665 3670 3675 



Asp He Gin He Val Ser Leu Gin Ser Ser Glu Pro His Pro His 
3680 3685 3690 



Leu Asp Val Leu Leu Phe Val Glu Lys Pro Gly Ser Ala Gin lie 
3695 3700 3705 



Ser Thr Lys Gin Leu Leu His Lys He Asn Ser Ser Val Thr Asp 
3710 , 3715 3720 



He Glu Glu He He Gly Val Arg lie Leu Asn Val Phe Gin Lys 
3725 3730 3735 



Leu Cys Ala Gly Leu Asp Cys Pro Trp Lys Phe Cys Asp Glu Lys 
3740 3745 3750 



Val Ser Val Asp Glu Ser Val Met Ser Thr His Ser Thr Ala Arg 
3755 3760 3765 



Leu Ser Phe Val Thr Pro Arg His His Arg Ala Ala Val Cys Leu 
3770 3775 3780 



Cys Lys Glu Gly Arg Cys Pro Pro Val His His Gly Cys Glu Asp 
3785 3790 3795 



Asp Pro Cys Pro Glu Gly Ser Glu Cys Val Ser Asp Pro Trp Glu 
3800 3805 3810 



Glu Lys His Thr Cys Val Cys Pro Ser Gly Arg Phe Gly Gin Cys 
3815 3820 3825 
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Pro Gly Ser S e r Ser Met Thr Leu Thr Gly Asn Ser Tyr Val Lys 
3830 3835 " 3840 



Tyr Arg Leu Thr Glu Asn Glu Asn Lys Leu Glu Met Lys Leu Thr 
3845 3850 3855 



Met Arg Leu Arg Thr Tyr Ser Thr His Ala Val Val Met Tyr Ala 
3860 3865 3870 



Arg Gly Thr Asp Tyr Ser lie Leu Glu lie His His Gly Arg Leu 
3875 3880 3885 



Gin Tyr Lys Phe Asp Cys Gly Ser Gly Pro Gly He Val Ser Val 
3890 3895 3900 



Gin Ser lie Gin Val Asn Asp Gly Gin Trp His Ala Val Ala Leu 
3905 3910 S915 



Glu Val Asn Gly Asn Tyr Ala Arg Leu Val Leu Asp Gin Val His 
3920 3925 3930 



Thr Ala Ser Gly Thr Ala Pro Gly Thr Leu Lys Thr Leu Asn Leu 
3935 3940 * 3945 



Asp Asn Tyr Val Phe Phe Gly Gly His He Arg Gin Gin Gly Thr 
3^50 3955 3960 



Arg His Gly Arg Ser Pro Gin Val Gly Asn Gly Phe Arg Gly Cys 
3965 3970 3975 



Het Asp Ser lie Tyr Leu Asn Gly Gin Glu Leu Pro Leu Asn Ser 
3980 3985 3990 



Lys Pro Arg ser Tyr Ala His He Glu Glu Ser Val Asp Val Ser 
3995 4000 4005 



Pro Gly Cys Phe Leu Thr Ala Thr Glu Asp Cys Ala ser Asn Pro 
4010 4015 4020 



Cys Gin Asn Gly Gly Val Cys Asn Pro Ser Pro Ala Gly Gly Tyr 
4025 4030 403S 



Tyr Cys Lys Cys Ser Ala Leu Tyr He Gly Thr His Cys Glu lie 
4040 4045 4050 
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Sex Val Asn Pro Cys Ser Ser Asn Pro Cys Leu Tyr Gly Gly Thr 
4055 4060 4065 



Cys Val Val Asp Asn Gly Gly Phe Val Cys Gin Cys Arg Gly Leu 
4070 4075 4080 



Tyr Thr Gly Gin Arg Cys Gin Leu Ser Pro Tyr Cys Lys Asp Glu 
4085 4090 4095 



pro Cys Lys Asn Gly Gly Thr Cys Phe Asp Ser Leu Asp Gly Ala 
4100 4105 4110 



Val Cys Gin Cys Asp Ser Gly Phe Arg Gly Glu Arg Cys Gin Ser 
4115 4120 4125 



Asp He Asp Glu Cys Ser Gly Asn Pro Cys Leu His Gly Ala Leu 
4130 4135 4140 



Cys Glu Asn Thr His Gly Ser Tyr His Cys Asn Cys Ser His Glu 
4145 4150 4155 



Tyr Arg Gly Arg His Cys Glu Asp Ala Ala Pro Asn Gin Tyr Val 
4160 4165 4170 



Ser Thr Pro Trp Asn He Gly Leu Ala Glu Gly lie Gly lie Val 
4175 4180 4185 



Val Phe Val Ala Gly lie Phe Leu Leu Val. Val Val Phe Val Leu 
4190 4195 4200 



Cys Arg Lys Met He Ser Arg Lys Lys Lys His Gin Ala Glu Pro 
4205 4210 4215 



Lys Asp Lys His Leu Gly Pro Ala Thr Ala Phe Leu Gin Arg Pro 
4220 4225 4230 



Tyr Phe Asp Ser Lys Leu Asn Lys Asn He Tyr Ser Asp He Pro 
4235 4240 4245 



Pro Gin Val Pro Val Arg Pro He ser Tyr Thr Pro Ser He Pro 
4250 4255 4260 

Ser Asp Ser Arg' Asn Asn Leu Asp Arg Asn Ser Phe Glu Gly Ser 
4265 4270 4275 
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Ala He pro Glu His Pro Glu Phe Ser Thr Phe Asn Pro Glu Ser 
4280 4285 4290 



Val His Gly His Arg Lys Ala val Ala Val Cys Ser Val Ala Pro 
4295 4300 4305 



Asn Leu Pro Pro Pro Pro Pro ser Asn Ser Pro Ser Asp Ser Asp 
4310 4315 4320 



Ser lie Gin Lys Pro Ser Trp Asp Phe Asp Tyr Asp Thr Lys Val 
4325 4330 4335 



Val Asp Leu Asp Pro Cys l^eu Ser Lys Lys Pro Leu Glu Glu Lys 
4340 4345 4350 



Pro Ser Gin Pro Tyr Ser Ala Arg Glu Ser Leu Ser Glu Val Gin 
4355 4360 4365 



Ser Leu Ser Ser Phe Gin Ser Glu Ser Cys Asp Asp Asn Gly Tyr 
4370 4395 4380 



His Trp Asp Thr Ser Asp Trp Met Pro Ser Val Pro Leu Pro Asp 
4385 4390 4395 



He Gin Glu Phe Pro Asn Tyr Glu Val He Asp Glu Gin Thr Pro 
4400 4405 " 4410 



Leu Tyr Ser Ala Asp Pro Asn Ala He Asp Thr Asp Tyr Tyr Pro 
4415 4420 4425 



Gly Gly Tyr Asp He Glu Ser Asp Phe Pro Pro Pro Pro Glu Asp 
4430 4435 4440 



Phe Pro Ala Ala Asp Glu Leu Pro Pro Leu Pro Pro Glu Phe Ser 
4445 4450 4455 



Asn Gin Phe Glu Ser He His Pro Pro Arg Asp Met Pro Ala Ala 
4460 4465 4470 



Gly Ser Leu Gly Ser Ser Ser Arg Asn Arg Gin Arg Phe Asn Leu 
4475 4480 4485 



Asn Gin Tyr Leu Pro Asn Phe Tyr Pro Leu Asp Met Ser Glu Pro 
4490 4495 4500 



Gin Thr Lys Gly Thr Gly Glu Asn Ser Thr Cys Arg Glu Pro His 
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4505 4510 4515 



Ala Pro Tyr Pro Pro Gly Tyr Gin Arg His Phe Glu Ala Pro Ala 
4520 4525 4530 



Val Glu Ser Mst Pro Met Ser Val Tyr Ala Ser Thr Ala Ser Cys 
4535 4540 . 4545 



Ser Asp Val Ser Ala Cys Cys Glu Val Glu Ser Glu Val Met Met 
4550 4555 4560 



Ser Asp Tyr Glu Ser Gly Asp Asp Gly His Phe Glu Glu Val Thr 
4565 4570 4575 



lie Pro Pro Leu Asp Ser Gin Gin His Thr Glu Val 
4580 4585 4590 



<210> 94 

<211> 202 

<212> PRT 

<213> Homo Sapiens 

<400> 94 

Met Cys Tyr Gly Lys Cys Ala Arg Cys He Gly His Ser Leu Val Gly 
1 5 10 15 



Leu Ala Leu Leu Cys He Ala Ala Asn He Leu Leu Tyr Phe Pro Asn 
20 25 30 



Gly Glu Thr hy& Tyr Ala Ser Glu Asn His Leu Ser Arg Phe Val Trp 
35 40 45 



Phe Phe Ser Gly He Val Gly Gly Gly Leu Leu Met Leu Leu Pro Ala 
50 55 60 



Phe Val Phe lie Gly Leu Glu Gin Asp Asp Cys Cys Gly Cys Cys Gly 
65 70 75 80 



His Glu Asn Cys Gly Lys Arg Cys Ala Met Leu Ser Ser Val Leu Ala 
85 90 95 



Ala Leu He Gly He Ala Gly Ser Gly Tyr Cys Val lie Val Ala Ala 
100 105 HO 



Leu Gly Leu Ala Glu Gly Pro Leu Cys Leu Asp Ser Leu Gly Gin Trp 
115 120 ~ 125 
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Asn Tyr Thr Phe Ma Ser Thr Glu Gly Gin Tyr Leu Leu Asp Thr Ser 
130 135 140 



Thr Trp Ser Glu Cys Thr Glu Pro Lys His lie Val Glu Trp Asn val 
145 150 155 ISO 



Ser Leu Phe Ser lie Leu Leu Ala Leu Gly Gly He Glu Phe lie Leu 
165 170 175 



Cys Leu lie Gin Val He Asn Gly Val Leu Gly Gly He Cys Gly Phe 
180 185 190 



Cys Cys Ser His Gin Gin Gin Tyr Asp Cys 
195 200 



<210> 95 

<211> 1035 

<212> PRT 

<213> Homo Sapiens 

<400> 95 

Met Ser Thr Glu Asn Val Glu Gly Lys Pro Ser Asn Leu Gly Glu Arg 
15 10 15 



Gly Arg Ala Arg Ser Ser Thr Phe Leu Arg Val Val Gin Pro Met Phe 
20 25 3 0 



Asn His Ser He Phe Thr Ser Ala Val Ser Pro Ala Ala Glu Arg He 
35 40 45 



Arg Phe He Leu Gly Glu Glu Asp Asp Ser Pro Ala Pro Pro Gin Leu 
50 55 60 



Phe Thr Glu Leu Asp Glu Leu Leu Ala Val Asp Gly Gin Glu Met Glu 
65 70 75 80 



Trp Lys Glu Thr Ala Arg Trp He Lys Phe Glu Glu Lys Val Glu Gin 
85 90 95 



Gly Gly Glu Arg Trp Ser Lys Pro His Val Ala Thr Leu Ser Leu His 
100 *" 105 110 



Ser Leu Phe Glu Leu Arg Thr Cys Met Glu Lys Gly Ser He Met Leu 
115 120 125 



Asp Arg Glu Ala ser Ser Leu Pro Gin Leu Val Glu Met He Val Asp 
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His Gin He Glu Thr Gly Leu Leu Lys Pro Glu Leu Lys Asp Lys Val 
145 150 155 ' ~ i 6 o 



Thr Tyr Thr Leu Leu Arg Lys His Arg His Gin Thr Lys Lys Ser Asn 
165 170 175 



Leu Arg Ser Leu Ala Asp He Gly Lys Thr Val Ser Ser Ala Ser Arg 
180 165 190 



Wet Phe Thr Asn Pro Asp Asn Gly Ser Pro Ala Met Thr His Arg Asn 
195 200 205 



Leu Thr Ser Ser Ser Leu Asn Asp He Ser Asp Lys Pro Glu Lys Asp 
210 215 220 



Gin Leu Lys Asn Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala 
225 230 235 " 240 



Ser Asn Val Leu Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe He 
245 250 255 



Ala Phe Val Arg Leu Gin Gin Ala Val Met Leu Gly Ala Leu Thr Glu 
260 265 270 



Val Pro Val Pro Thr Arg Phe Leu Phe lie Leu Leu Gly Pro Lys Gly 
275 280 285 



Lys Ala Lys Ser Tyr His Glu He Gly Arg Ala He Ala Thr Leu Met 
290 295 300 



Ser Asp Glu Val Phe His Asp He Ala Tyr Lys Ala Lys Asp Arg His 
305 310 315 " 320 



Asp Leu He Ala Gly He Asp Glu Phe Leu Asp Glu Val He Val Leu 
325 330 * 335 



Pro Pro Gly Glu Trp Asp Pro Ala He Arg He Glu Pro Pro Lys Ser 
340 345 350 



Leu Pro Ser Ser Asp Lys Arg Lys Asn Met Tyr Ser Gly Gly Glu Asn 
355 360 365 



Val Gin Met Asn Gly Asp Thr Pro His Asp Gly Gly His Gly Gly Gly 
370 375 380 
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Gly His Gly Asp Cys Glu Glu Leu Gin Arg Thr Gly Arg Phe Cys Gly 
385 390 395 400 



Gly Leu He Lys Asp lie Lys Arg Lys Ala Pro Phe Phe Ala Ser Asp 
405 410 415 



Phe Tyr Asp Ala Leu Asn lie Gin Ala Leu Ser Ala .He Leu Phe He 
420 425 430 



Tyr Leu Ala Thr Val Thr Asn Ala He Thr Phe Gly Gly Leu Leu Gly 
435 440 445 



Asp Ala Thr Asp Asn Met Gin Gly Val Leu Glu Ser Phe Leu Gly Thr 
450 455 460 



Ala Val Ser Gly Ala He Phe Cys Leu Phe Ala Gly Gin Pro Leu Thr 
465 470 475 480 



lie Leu Ser Ser Thr Gly Pro Val Leu Val Phe Glu Arg Leu Leu Phe 
485 490 495 



Asn Phe Ser Lys Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp 
500 505 510 



lie Gly Leu Trp Ser Ala Phe Leu Cys Leu He Leu Val Ala Thr Asp 
515 520 525 



Ala Ser Phe Leu Val Gin Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe 
530 535 540 



Ser Ser Leu He Ser Phe He Phe He Tyr Asp Ala Phe Lys Lys Met 
545 550 555 560 



He Lys Leu Ala Asp Tyr Tyr Pro He Asn Ser Asn Phe Lys Val Gly 
565 570 575 



Tyr Asn Thr Leu Phe Ser Cys Thr Cys Val Pro Pro Asp Pro Ala Aan 
580 585 590 



He Ser He Ser Asn Asp Thr Thr Leu Ala Pro Glu Tyr Leu Pro Thr 
595 600 605 



Met Ser Ser Thr Asp Met Tyr His Asn Thr Thr Phe Asp Trp Ala Phe 
610 615 620 
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Leu Ser Lys Lys Glu Cys Ser Lys Tyr Gly Gly Asn Leu Val Gly Asn 
625 630 635 640 



Asn Cys Asn Phe Val Pro Asp He Thr Leu Met Ser Phe He Leu Phe 
645 650 655 



Leu Gly Thr Tyr Thr Ser Ser Met Ala Leu Lys Lys Phe Lys Thr Ser 
660 665 670 



Pro Tyr Phe Pro Thr Thr Ala Arg Lys Leu He Ser Asp Phe Ala He 
675 680 685 



He Leu Ser He Leu He Phe Cys Val He Asp Ala Leu Val Gly Val 
690 695 700 



Asp Thr Pro Lys Leu He Val Pro Ser Glu Phe Lys Pro Thr Ser Pro 
705 710 715 720 



Asn Arg Gly Trp Phe Val Pro Pro Phe Gly Glu Asn Pro Trp Trp Val 
725 730 ~ 735 



Cys Leu Ala Ala Ala He Pro Ala Leu Leu Val Thr He Leu He Phe 
740 745 750 



Met Asp Gin Gin He Thr Ala Val He Val Asn Arg Lys Glu His Lys 
755 760 765 



Leu Lys Lys Gly Ala Gly Tyr His Leu Asp Leu Phe Trp Val Ala He 
770 775 780 



Leu Met Val He Cys Ser Leu Met Ala Leu Pro Trp Tyr Val Ala Ala 
785 790 795 * 800 



Thr Val He Ser He Ala His He Asp Ser Leu Lys Met Glu Thr Glu 
805 810 815 



Thr Ser Ala Pro Gly Glu Gin Pro Lys Phe Leu Gly Val Arg Glu Gin 
820 825 830 



Arg Val Thr Gly Thr Leu Val Phe lie Leu Thr Gly Leu Ser Val Phe 
B35 840 845 



Met Ala Pro He Leu Lys Phe He Pro Met Pro Val Leu Tyr Gly Val 
850 855 860 
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Phe Leu Tyr Met Gly Val Ala Ser Leu Asn Gly Val Gin Phe Met Asp 
865 870 875 880 

Arg Leu Lys Leu Leu Leu Met Pro Leu Lys His Gin Pro Asp phe lie 
885 890 895 



Tyr Leu Arg His Val Pro Leu Arg Arg Val His Leu Phe Thr Phe Leu 
900 905 910 



Gin Val Leu Cys Leu Ala Leu Leu Trp lie Leu Lys Ser Thr Val Ala 
BIS 920 ' 925 • 



Ala He He Phe Pro Val Met lie Leu Ala Leu Val Ala Val Arg Lys 
930 935 940 



Gly Met Asp Tyr Leu Phe Ser Gin His Asp Leu Ser Phe Leu Asp Asp 
^45 950 955 960 



Val lie Pro Glu Lys Asp Lys Lys Lys Lys Glu Asp Glu Lys Lys Lys 
965 970 975 



Lys Lys Lys Lys Gly Ser Leu Asp Ser Asp Asn Asp Asp Ser Asp Cys 
980 985 * 990 



Pro Tyr Ser Glu Lys Val Pro Ser lie Lys lie Pro Met Asp He Met 
995 1000 1005 



Glu Gin Gin Pro Phe Leu Ser Asp Ser Lys Pro Ser Asp Arg Glu 
1010 1015 1020 



Arg Ser Pro Thr Phe Leu Glu Arg His Thr Ser Cys 
1025 1030 1035 



*210> 96 

<211> 480 

<212> PRT 

<r213> Homo Sapiens 

<400> 96 

Met Ser Thr Pro Gly Val Asn Ser Ser Ala Ser Leu Ser Pro Asp Arg 
15 10 15 



Leu Asn Ser Pro Val Thr He Pro Ala Val Met Phe He Phe Gly Val 
20 25 30 



Val Gly Asn Leu Val Ala He Val Val Leu Cys Lys Ser Arg Lys Glu 
35 40 45 
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Gin Lys Glu Thr Thr Phe Tyr Thr Leu Val Cys Gly Leu Ala Val Thr 
50 55 60 



Asp Leu Leu Gly Thr Leu Leu Val Ser Pro Val Thr lie Ala Thr Tyr 
65 70 75 80 



Met Lys Gly Gin Trp Pro Gly Gly Gin Pro Leu Cys Glu Tyr Ser Thr 
85 90 95 



Phe lie Leu Leu Phe Phe Ser Leu Ser Gly Leu Ser lie lie Cys Ala 
100 105 110 



Met Ser Val Glu Arg Tyr Leu Ala lie Asn His Ala Tyr Phe Tyr Ser 
115 120 125 



His Tyr Val Asp Lys Arg Leu Ala Gly Leu Thr Leu Phe Ala Val Tyr 
130 135 140 



Ala Ser Asn Val Leu Phe Cys Ala Leu Pro Asn Met Gly Leu Gly Ser 
145 150 155 160 



Ser Arg Leu Gin Tyr Pro Asp Thr Trp Cys Phe He Asp Trp Thr Thr 
165 170 175 



Asn Val Thr Ala His Ala Ala Tyr Ser Tyr Met Tyr Ala Gly Phe Ser 
180 185 190 



Ser Phe Leu lie Leu Ala Thr Val Leu Cys Asn Val Leu Val Cys Gly 
195 200 205 



Ala Leu Leu Arg Met His Arg Gin Phe Met Arg Arg Thr Ser Leu Gly 
210 215 220 



Thr Glu Gin His His Ala Ala Ala Ala Ala Ser Val Ala Ser Arg Gly 
225 230 235 240 



His Pro Ala Ala Ser Pro Ala Leu Pro Arg Leu Ser Asp Pne Arg Arg 
245 250 255 



Rrg Arg Ser Phe Arg Arg He Ala Gly Ala Glu He Gin Met Val He 
260 265 270 



Leu Leu He Ala Thr Ser Leu Val Val Leu He Cys Ser He Pro Leu 
275 280 285 
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Val Val Arg Val Phe Val Asn Gin Leu Tyr Gin Pro Ser Leu Glu Arg 
290 295 300 



Glu Val Ser Lys Asn Pro Asp Leu Gin Ala lie Arg He Ala Ser Val 
305 310 315 320 



Asn Pro lie Leu Asp Pro Trp He Tyr lie Leu Leu Arg Lys Thr Val 
325 330 335 



Leu Ser Lys Ala He Glu Lys He Lys Cys Leu Phe Cys Arg He Gly 
340 345 350 



Gly Ser Arg Arg Glu Arg Ser Gly Gin His Cys Ser Asp Ser Gin Arg 
355 360 365 



Thr Ser Ser Ala Met Ser Gly His Ser Arg Ser Phe He Ser Arg Glu 
370 375 380 



Leu Lys Glu He Ser Ser Thr Ser Gin Thr Leu Leu Pro Asp Leu Ser 
385 390 395 400 



Leu Pro Asp Leu Ser Glu Asn Gly Leu Gly Gly Arg Asn Leu Leu Pro 
405 410 415 



Gly Val Pro Gly Met Gly Leu Ala Gin Glu Asp Thr Thr Ser Leu Arg 
420 425 430 



Thr Leu Arg He Ser Glu Thr Ser Asp Ser Ser Gin Gly Gin Asp Ser 
435 440 445 



Glu Ser Val Leu Leu Val Asp Glu Ala Gly Gly Ser Gly Arg Ala Gly 
450 455 460 



Pro Ala Pro Lys Gly Ser Ser Leu Gin Val Thr Phe Pro Ser Glu Thr 
465 470 475 480 



<210> 97 

<211> 335 

*212> PRT 

<213> Homo Sapiens 

<400> 97 

Met Gly His Pro Pro Leu Leu Pro Leu Leu Leu Leu Leu His Thr Cys 
15 10 15 



Val Pro Ala Ser Trp Gly Leu Arg Cys Met Gin Cys Lys Thr Asn Gly 
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Asp Cys Arg Val Glu Glu Cys Ala Leu Gly Gin Asp Leu Cys Arg Thr 
35 40 45 



Thr lie Val Arg Leu Trp Glu Glu Gly Glu Glu Leu Glu Leu Val Glu 
50 55 * 60 



Lys Ser Cys Thr His Ser Glu Lys Thr Asn Arg Thr Leu Ser Tyr Arg 
65 70 75 80 



Thr Gly Leu Lys lie Thr Ser Leu Thr Glu Val Val Cys Gly Leu Asp 
85 90 95 



Leu Cys Asn Gin Gly Asn Ser Gly Arg Ala Val Thr Tyr Ser Arg Ser 
100 105 110 



Arg Tyr Leu Glu Cys He Ser Cys Gly Ser ser Asp Met Ser Cys Glu 
115 120 125 



Arg Gly Arg His Gin Ser Leu Gin Cys Arg Ser Pro Glu Glu Gin Cys 
130 135 140 



Leu Asp Val Val Thr His Trp He Gin Glu Gly Glu Glu Gly Arg Pro 
145 150 155 160 



Lys Asp Asp Arg His Leu Arg Gly Cys Gly Tyr Leu Pro Gly Cys Pro 
165 170 175 



Gly Ser Asn Gly Phe His Asn Asn Asp Thr Phe His Phe Leu Lys Cys 
180 185 190 



Cys Asn Thr Thr Lys Cys Asn Glu Gly Pro lie Leu Glu Leu Glu Asn 
195 200 205 



Leu Pro Gin Asn Gly Arg Gin Cys Tyr Ser Cys Lys Gly Asn Ser - Thr 
210 215 220 



His Gly Cys Ser Ser Glu Glu Thr Phe Leu He Asp Cys Arg Gly Pro 
225 230 235 240 



Met Asn Gin Cys Leu Val Ala Thr Gly Thr His Glu Pro Lys Asn Gin 
245 250 255 



Ser Tyr Met Val Arg Gly Cys Ala Thr Ala Ser Met Cys Gin His 
260 265 270 



Ala 
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His Leu Gly Asp Ala Phe Ser Met Asn His lie Asp Val Ser Cys Cys 
275 280 285 



Thr Lys Ser Gly Cys Asn His Pro Asp Leu Asp Val Gin Tyr Arg Ser 
290 295 300 



Gly Ala Ala Pro Gin Pro Gly Pro Ala His Leu Ser Leu Thr lie Thr 
305 310 315 320 



Leu Leu Met Thr Ala Arg Leu Trp Gly Gly Thr Leu Leu Trp Thr 
325 330 " 335 



<210> 98 

<211> 512 

<212> PRT 

<213> Homo Sapiens 

<400> 98 

Met Asp Phe Glu Ser Gly Gin Val Asp Pro Leu Ala Ser Val He Leu 
1 5 10 15 



Pro Pro Asn Leu Leu Glu Asn Leu Ser Pro Glu Asp Ser Val Leu Val 
20 25 30 



Arg Arg Ala Gin Phe Thr Phe Phe Asn Lys Thr Gly Leu Phe Gin Asp 
35 40 45 



Val Gly Pro Gin Arg Lys Thr Leu Val Ser Tyr Val Met Ala Cys Ser 
50 55 60 



He Gly Asn lie Thr He Gin Asn Leu Lys Asp Pro Val Gin He Lys 
65 70 75 80 



He Lys His Thr Arg Thr Gin Glu Val His His Pro He Cys Ala Phe 
85 90 95 



Trp Asp Leu Asn Lys Asn Lys Ser Phe Gly Gly Trp Asn Thr Ser Gly 
100 105 * " 110 



Cys Val Ala His Arg Asp Ser Asp Ala Ser Glu Thr Val Cys Leu Cys 
115 120 125 



Asn His Phe Thr His Phe Gly Val Leu Met Asp Leu Pro Arg Ser Ala 
13 0 135 140 
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Ser Gin Leu Asp Ala Arg Asn Thr Lys Val Leu Thr Phe He Ser Tyr 
145 150 155 160 



He Gly Cys Gly He Ser Ala He Phe Ser Ala Ala Thr Leu Leu Thr 
165 170 175 



Tyr Val Ala Phe Glu Lys Leu Arg Arg Asp Tyr Pro Ser Lys He Leu 
180 185 190 



Met Asn Leu Ser Thr Ala Leu Leu Phe Leu Asn Leu Leu Phe Leu Leu 
195 200 205 



Asp Gly Trp He Thr Ser Phe Asn Val Asp Gly Leu Cys lie Ala Val 
210 215 220 



Ala Val Leu Leu His Phe Phe Leu Leu Ala Thr Phe Thr Trp Met Gly 
225 230 235 240 



Leu Glu Ala He His Met Tyr He Ala Leu Val Lys Val Phe Asa Thr 
245 250 255 



Tyr He Arg Arg Tyr He Leu Lys Phe Cys He He Gly Trp Gly Leu 
260 265 270 



Pro Ala Leu Val Val Ser Val Val Leu Ala Ser Arg Asn Asn Asn Glu 
275 280 285 



Val Tyr Gly Lys Glu Ser Tyr Gly Lys Glu Lys Gly Asp Glu Phe Cys 
290 295 300 



Trp He Gin Asp Pro Val He Phe Tyr Val Thr Cys Ala Gly Tyr Phe 
305 310 315 320 



Gly Val Met Phe Phe Leu Asn He Ala Met Phe He Val Val Met Val 
325 330 335 



Gin He Cys Gly Arg Asn Gly Lys Arg Ser Asn Arg Thr Leu Arg Glu 
■340 345 350 



Glu Val Leu Arg Asn Leu Arg Ser Val Val Ser Leu Thr Phe Leu Leu 
355 360 365 



Gly Met Thr Trp Gly Phe Ala Phe Phe Ala Trp Gly Pro Leu Asn lie 
370 375 380 



Pro Phe Met Tyr Leu Phe Ser He Phe Asn Ser Leu Gin Gly Leu Phe 
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lie Phe lie Phe His Cys Ala Met Lys Glu Asn Val Gin Lys Gin Trp 
405 410 415 



Arg Arg His Leu Cys Cys Gly Arg Phe Arg Leu Ala Asp Asn Ser Asp 
420 425 430 



Trp Ser Lys Thr Ala Thr Asn lie He Lys Lys Ser Ser Asp Asn Leu 
435 440 445 



Gly Lys Ser Leu Ser Ser Ser Ser He Gly Ser Asn Ser Thr Tyr Leu 
450 455 460 



Thr Ser Lys Ser Lys Ser Ser Ser Thr Thr Tyr Phe Lys Arg Asn Ser 
465 470 475 480 



His Thr Asp Asn Val Ser Tyr Glu His Ser Phe Asn Lys Ser Gly Ser 
485 490 495 



Leu Arg Gin Cys Phe His Gly Gin Val Leu Val Lys Thr Gly Pro Cys 
500 505 510 



<210> 99 

<211> 202 

<212> PRT 

<213> Homo Sapiens 

<400> 99 

Met Lys Val Leu Ala Ala Gly Val Val Pro Leu Leu Leu Val Leu His 
15 10 15 



Trp Lys His Gly Ala Gly Ser Pro Leu Pro He Thr Pro Val Asn Ala 
20 25 30 



Thr Cys Ala lie Arg His Pro Cys His Asn Asn Leu Met Asn Gin He 
35 40 45 



Arg Ser Gin Leu Ala Gin Leu Asn Gly Ser Ala Asn Ala Leu Phe He 
50 55 60 



Leu Tyr Tyr Thr Ala Gin Gly Glu Pro Phe Pro Asn Asn Leu Asp Lys 
65 70 75 A 80 



Leu Cys Gly Pro Asn Val Thr Asp Phe Pro Pro Phe His Ala Asn Gly 
85 90 95 
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Thr Glu Lys Ala Lys Leu Val Glu Leu Tyr Arg lie Val Val Tyr Leu 
100 105 110 



Gly Thr Ser Leu Gly Asn He Thr Arg Asp Gin Lys He Leu Asn Pro 
115 120 125 



Ser Ala Leu Ser Leu His Ser Lys Leu Asn Ala Thr Ala Asp He Leu 
130 135 140 



Arg Gly Leu Leu Ser Asn Val Leu Cys Arg Leu Cys Ser Lys Tyr His 
145 150 155 160 



Val Gly His Val Asp Val Thr Tyr ' Gly Pro Asp Thr Ser Gly Lys Asp 
165 170 175 



Val Phe Gin Lys Lys Lys Leu Gly Cys Gin Leu Leu Gly Lys Tyr Lys 
180 185 190 



Gin He He Ala Val Leu Ala Gin Ala Phe 

200 





195 


<210> 


100 


<211> 


504 


<212> 


PRT 


<213> 


Homo Sapiens 


<400> 


100 


Met Thr Pro Ser Pro 


1 


5 



10 15 



Gly Ala Phe Pro Pro Ala Ala Ala Ala Arg Gly Pro Pro Lys Met Ala 
20 25 30 



Asp Lys Val Val Pro Arg Gin Val Ala Arg Leu Gly Arg Thr Val Arg 
35 40 45 



Leu Gin Cys Pro Val Glu Gly Asp Pro Pro Pro Leu Thr Met Trp Thr 
50 55 60 



Lys Asp Gly Arg Thr He His Ser Gly Trp Ser Arg Phe Arg Val Leu 
65 70 75 80 



Pro Gin Gly Leu Lys Val Lys Gin Val Glu Arg Glu Asp Ala Gly Val 
85 90 95 



Tyr Val Cys Lys Ala Thr Asn Gly Phe Gly Ser Leu Ser Val Asn Tyr 
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Thr Leu Val Val Leu Asp Asp He Ser Pro Gly Lys Glu Ser Leu Gly 
115 120 125 



Pro Asp Ser Ser Ser Gly Gly Gin Glu Asp Pro Ala Ser Gin Gin Trp 
130 135 140 



Ala Arg Pro Arg Phe Thr Gin Pro Ser Lys Met Arg Arg Arg Val lie 
145 150 155 160 



Ala Arg Pro Val Gly Ser Ser Val Arg Leu Lys Cys Val Ala Ser Gly 
165 170 175 



His Pro Arg Pro Asp He Thr Trp Met Lys Asp Asp Gin Ala Leu Thr 
180 185 190 



Arg Pro Glu Ala Ala Glu Pro Arg Lys Lys Lys Trp Thr Leu Ser Leu 
195 200 205 



Lys Asn Leu Arg Pro Glu Asp Ser Gly Lys Tyr Thr Cys Arg Val Ser 
210 215 220 



Asn Arg Ala Gly Ala He Asn Ala Thr Tyr Lys Val Asp Val He Gin 
225 230 235 240 



Arg Thr Arg Ser Lys Pro Val Leu Thr Gly Thr His Pro Val Asn Thr 
245 250 255 



Thr Val Asp Phe Gly Gly Thr Thr Ser Phe Gin Cys Lys Val Arg Ser 
260 265 270 



Asp Val Lys Pro Val lie Gin Trp Leu Lys Arg Val Glu Tyr Gly Ala 
275 280 285 



Glu Gly Arg His Asn Ser Thr He Asp Val Gly Gly Gin Lys Phe Val 
- 290 295 300 



Val Leu Pro Thr Gly Asp Val Trp Ser Arg Pro Asp Gly Ser Tyr Leu 
305 310 315 320 



Asn Lys Leu Leu He Thr Arg Ala Arg Gin Asp Asp Ala Gly Met Tyr 
325 330 335 



He Cys Leu Gly Ala Asn Thr Met Gly Tyr Ser Phe Arg Ser Ala Phe 
340 345 350 
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Leu Thr Val Leu Pro Asp Pro Lys pro Gin Gly Pro Pro Val Ala Ser 
355 360 365 



Ser Ser Ser Ala Thr Ser Leu Pro Trp Pro Val Val lie Gly lie Pro 
370 375 380 



Ala Gly Ala Val Phe He Leu Gly Thr Leu Leu Leu Trp Leu Cys Gin 
385 390 395 400 



Ala Gin Lys Lys Pro Cys Thr Pro Ala Pro Ala Pro Pro Leu Pro Gly 
405 410 415 



His Arg Pro Pro Gly Thr Ala Arg Asp Arg Ser Gly Asp Lys Asp Leu 
420 425 430 



Pro Ser Leu Ala Ala Leu Ser Ala Gly Pro Gly Val Gly Leu Cys Glu 
435 440 445 



Glu His Gly Ser Pro Ala Ala Pro Gin His Leu Leu Gly Pro Gly Pro 
450 455 460 



Val Ala Gly Pro Lys Leu Tyr Pro Lye Leu ryr Thr Asp He His Thr 
465 470 475 480 



His Thr His Thr His Ser His Thr His Ser His Val Glu Gly Lys Val 
485 490 495 



His Gin His He His Tyr Gin Cys 
500 



<210> 101 

<211> 915 

<212> PRT 

<213> Homo Sapiens 

<400> 101 

Met Gly Arg Pro Arg Leu Thr Leu Val Cys His Val Ser He He He 
1 5 10 15 



Ser Ala Arg Asp Leu Ser Met Asn Asn Leu Thr Glu Leu Gin Pro Gly 
20 25 30 



Leu Phe His His Leu Arg Phe Leu Glu Glu Leu Arg Leu Ser Gly Asn 
35 40 45 
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His Leu Ser His lie Pro Gly Gin Ala Phe Ser Gly Leu Tyr Ser Leu 
50 55 60 



Lys lie Leu Met Leu Gin Asn Asn Gin Leu Gly Gly lie Pro Ala Glu 
65 70 75 80 



Ala Leu Trp Glu Leu Pro Ser Leu Gin Ser Leu Arg Leu Asp Ala Asn 
85 90 95 



Leu lie Ser Leu Val Pro Glu Arg Ser Phe Glu Gly Leu Ser Ser Leu 
100 105 110 



Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu lie Pro Val Arg 
115 120 125 



Ala Leu Asn Asn Leu Pro Ala Leu Gin Ala Met Thr Leu Ala Leu Asn 
130 135 140 



Arg lie Ser His He Pro Asp Tyr Ala Phe Gin Asn Leu Thr Ser Leu 
145 150 155 160 



Val Val Leu His Leu His Asn Asn Arg, He Gin His Leu Gly Thr His 
165 170 175 



Ser Phe Glu Gly Leu His Asn Leu Glu Thr Leu Asp Leu Asn Tyr Asn 
180 185 190 



Lys Leu Gin Glu Phe Pro Val Ala He Arg Thr Leu Gly Arg Leu Gin 
195 200 205 



Glu Leu Gly Phe His Asn Asn Asn He Lys Ala He Pro Glu Lys Ala 
210 215 220 



Phe Met Gly Asn Pro Leu Leu Gin Thr lie His Phe Tyr Asp Asn Pro 
225 230 235 240 



He Gin Phe Val Gly Arg Ser Ala Phe Gin Tyr Leu Pro Lys Leu His 
245 250 255 



Thr Leu Ser Leu Asn Gly Ala Met Asp He Gin Glu Phe Pro Asp Leu 
260 265 270 



Lys Gly Thr Thr Ser Leu Glu He Leu Thr Leu Thr Arg Ala Gly He 
275 280 285 



Arg Leu Leu Pro Ser Gly Met Cys Gin Gin Leu Pro Arg Leu Arg Val 
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Leu Glu Leu Ser His Asn Gin lie Glu Glu Leu Pro Ser Leu His Arg 
305 310 315 320 



Cys Gin Lys Leu Glu Glu lie Gly Leu Gin His Asn Arg lie Trp Glu 
325 330 335 



lie Gly Ala Asp Thr Phe Ser Gin Leu Ser Ser Leu Gin Ala Leu Asp 
340 345 350 



Leu Ser Trp Asn Ala He Arg Ser He His Pro Glu Ala Phe Ser Thr 
355 360 365 



Leu His Ser Leu Val Lys Leu Asp Leu Thr Asp Asn Gin Leu Thr Thr 
370 375 380 



Leu Pro Leu Ala Gly Leu Gly Gly Leu Met His Leu Lys Leu Lys Gly 
385 390 395 400 



Asn Leu Ala Leu Ser Gin Ala Phe Ser Lys Asp Ser Phe Pro Lys Leu 
405 410 415 



Arg He Leu Glu Val Pro Tyr Ala Tyr Gin Cys Cys Pro Tyr Gly Met 
420 425 430 



Cys Ala Ser Phe Phe Lys Ala Ser Gly Gin Trp Glu Ala Glu Asp Leu 
435 440 445 



His Leu Asp Asp Glu Glu Ser Ser Lys Arg Pro Leu Gly Leu Leu Ala 
450 455 460 



Arg Gin Ala Glu Adn His Tyr Asp Gin Asp Leu Asp Glu Leu Gin Leu 
465 470 475 480 



Glu Met Glu Asp Ser Lys Pro His Pro Ser Val Gin Cys Ser Pro Thr 
485 490 495 



Pro Gly Pro Phe Lys Pro Cys Glu Tyr Leu Phe Glu Ser Trp Gly He 
500 505 510 



Arg Leu Ala Val Trp Ala He Val Leu Leu Ser Val Leu Cys Asn Gly 
515 .520 525 



Leu Val Leu Leu Thr Val Phe Ala Gly Gly Pro Val Pro Leu pro Pro 
530 535 540 
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Val Lys Phe Val Val Gly Ala lie Ala Gly Ala Asn Thr Leu Thr Gly 
545 550 555 560 



lie Ser Cys Gly Leu Leu Ala Ser Val Asp Ala Leu Thr Phe Gly Gin 
565 570 575 



Phe Ser Glu Tyr Gly Ala Arg Trp Glu Thr Gly Leu Gly Cys Arg Ala 
580 585 590 



Thr Gly Phe Leu Ala Val Leu Gly Ser Glu Ala Ser Val Leu Leu Leu 
595 600 605 



Thr Leu Ala Ala Val Gin Cys Ser Val Ser Val Ser Cys Val Arg Ala 
610 615 620 



Tyr Gly Lys Ser Pro Ser Leu Gly Ser Val Arg Ala Gly Val Leu Gly 
625 630 635 640 



Cys Leu Ala Leu Ala Gly Leu Ala Ala Ala Leu Pro Leu Ala Ser Val 
645 650 655 



Gly Glu Tyr Gly Ala Ser Pro Leu Cys Leu Pro Tyr Ala Pro Pro Glu 
660 665 670 



Gly Gin Pro Ala Ala Leu Gly Phe Thr Val Ala Leu Val Met Met Asn 
675 680 685 



Ser Phe Cys Phe Leu Val Val Ala Gly Ala Tyr lie Lys Leu Tyr Cys 
690 695 700 



Asp Leu Pro Arg Gly Asp Phe Glu Ala Val Trp Asp Cys Ala Met Val 
705 710 715 720 



Arg His Val Ala Trp Leu lie Phe Ala Asp Gly Leu Leu Tyr Cys Pro 
725 730 735 



Val Ala Phe Leu Ser Phe Ala Ser Met Leu Gly Leu Phe Pro Val Thr 
740 745 750 



Pro Glu Ala Val Lys Ser Val Leu Leu Val Val Leu Pro Leu Pro Ala 
755 760 765 



Cys Leu Asn Pro Leu Leu Tyr Leu Leu Phe Asn Pro His Phe Arg Asp 
770 775 780 
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Asp Leu Arg Arg Leu Arg Pro Arg Ala Gly Asp Ser Gly Pro Leu Ala 
785 790 795 800 



Tyr Ala Ala Ala Gly Glu Leu Glu Lys Ser Ser Cys Asp Ser Thr Gin 
805 810 815 



Ala Leu Val Ala Phe Ser Asp Val Asp Leu lie Leu Glu Ala Ser Glu 
820 825 830 



Ala Gly Arg Pro Pro Gly Leu Glu Thr Tyr Gly Phe Pro Ser Val Thr 
835 840 845 



Leu lie Ser Cys Gin Gin Pro Gly Ala Pro Arg Leu Glu Gly Ser His 
850 855 860 



Cys Val Glu Pro Glu Gly Asn His Phe Gly Asn Pro Gin Pro Ser Met 
865 870 875 880 



Asp Gly Glu Leu Leu Leu Arg Ala Glu Gly Ser Thr Pro Ala Gly Gly 
885 890 895 



Gly Leu Ser Gly Gly Gly Gly Phe Gin Pro Ser Gly Leu Ala Phe Ala 
900 905 910 



Ser His Val 
915 



<210> 


102 


<211> 


647 


<212> 


PRT 


<213> 


Homo Sapiens 


<400> 


102 


Wet Ala Ser Leu Val 


1 


5 



10 15 



Val Val Thr Ala Thr Ala Ser Pro Pro Ala Gly Leu Leu Ser Leu Leu 
20 25 30 



Thr Ser Gly Gin Gly Ala Leu Asp Gin Glu Ala Leu Gly Gly Leu Leu 
35 40 45 



Asn Thr Leu Ala Asp Arg Val His Cys Thr Asn Gly Pro Cys Gly Lys 
50 55 60 



Cys Leu Ser Val Glu Asp Ala Leu Gly Leu Gly Glu Pro Glu Gly Ser 
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65 70 75 80 



Gly Leu Pro Pro Gly Pro Val Leu Glu Ala Arg Tyr Val Ala Arg Leu 
85 90 95 



Ser Ala Ala Ala Val Leu Tyr Leu Ser Asn Pro Glu Gly Thr Cys Glu 
100 105 110 



Asp Thr Arg Ala Gly Leu Trp Ala Ser His Ala Asp His Leu Leu Ala 
115 120 125 



Leu Leu Glu Ser Pro Lys Ala Leu Thr Pro Gly Leu Ser Trp Leu Leu 
130 135 140 



Gin Arg Met Gin Ala Arg Ala Ala Gly Gin Thr Pro Lys Thr Ala Cys 
145 150 155 160 



Val Asp lie Pro Gin Leu Leu Glu Glu Ala Val Gly Ala Gly Ala Pro 
165 170 175 



Gly Ser Ala Gly Gly Val Leu Ala Ala Leu Leu Asp His Val Arg Ser 
180 185 190 



Gly Ser Cys Phe His Ala Leu Pro Ser Pro Gin Tyr Phe Val Asp Phe 
195 200 205 



Val Phe Gin Gin His Ser Ser Glu Val Pro Met Thr Leu Ala Glu Leu 
210 215 220 



Ser Ala Leu Met Gin Arg Leu Gly Val Gly Arg Glu Ala His Ser Asp 
225 230 235 240 



His Ser His Arg His Arg Gly Ala Ser Ser Arg Asp Pro Val Pro Leu 
245 250 255 



He Ser Ser Ser Asn Ser Ser Ser Val Trp Asp Thr Val Cys Leu Ser 
260 265 270 



Ala Arg Asp Val Met Ala Ala Tyr Gly Leu Ser Glu Gin Ala Gly Val 
275 280 285 



Thr Pro Glu Ala Trp Ala Gin Leu Ser Pro Ala Leu Leu Gin Gin Gin 
290 295 300 



Leu Ser Gly Ala Cys Thr Ser Gin Ser Arg Pro Pro Val Gin Asp Gin 
305 310 315 320 
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Leu Ser Gin Ser Glu Arg Tyr Leu Tyr Gly Ser Leu Ala Thr Leu Leu 
325 330 335 



He Cys Leu Cys Ala Val Phe Gly Leu Leu Leu Leu Thr Cys Thr Gly 
340 345 350 



Cys Arg Gly Val Ala His Tyr He Leu Gin Thr Phe Leu Ser Leu Ala 
355 360 365 



Val Gly Ala Leu Thr Gly Asp Ala Val Leu His Leu Thr Pro Lys Val 
370 375 380 



Leu Gly Leu His Thr His Ser Glu Glu Gly Leu Ser Pro Gin Pro Thr 
385 390 395 400 



Trp Arg Leu Leu Ala Met Leu Ala Gly Leu Tyr Ala Phe Phe Leu Phe 
405 410 415 



Glu Asn Leu Phe Asn Leu Leu Leu Pro Arg Asp Pro Glu Asp Leu Glu 
420 425 430 



Asp Gly Pro Cys Gly His Ser Ser His Ser His Gly Gly His Ser His 
435 440 445 



Gly Val Ser Leu Gin Leu Ala Pro Ser Glu Leu Arg Gin Pro Lys Pro 
450 455 460 



Pro His Glu Gly Ser Arg Ala Asp Leu Val Ala Glu Glu Ser Pro Glu 
465 470 475 480 



Leu Leu Asn Pro Glu Pro Arg Arg Leu Ser Pro Glu Leu Arg Leu Leu 
485 490 495 



Pro Tyr Met He Thr Leu Gly Asp Ala Val His Asn Phe Ala Asp Gly 
500 505 510 



Leu Ala Val Gly Ala Ala Phe Ala Ser Ser Trp Lys Thr Gly Leu Ala 
515 " 520 525 



Thr Ser Leu Ala Val Phe Cys His Glu Leu Pro His Glu Leu Gly Asp 
530 535 540 



Phe Ala Ala Leu Leu His Ala Gly Leu Ser Val Arg Gin Ala Leu Leu 
54S 550 555 560 
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Leu Asn Leu Ala Ser Ala Leu Thr Ala Phe Ala Gly Leu Tyr Val Ala 
565 570 " 575 



Leu Ala Val Gly Val Ser Glu Glu Ser Glu Ala Trp He Leu Ala Val 
580 585 590 



Ala Thr Gly Leu Phe Leu Tyr Val Ala Leu Cys Asp Met Leu Pro Ala 
595 600 605 



Met Leu Lys Val Arg Asp Pro Arg Pro Trp Leu Leu Phe Leu Leu His 
610 615 620 



Asn Val Gly Leu Leu Gly Gly Trp Thr Val Leu Leu Leu Leu Ser Leu 
625 630 635 640 



Tyr Glu Asp Asp lie Thr Phe 
645 



<210> 103 

<2ll> 522 

<212> PRT 

<213> Homo Sapiens 

<400> 103 

Met Asp Phe Leu Leu Leu Gly Leu Cys Leu Tyr Trp Leu Leu Arg Arg 
1 5 10 15 



Pro Ser Gly Val Val Leu Cys Leu Leu Gly Ala Cys Phe Gin Met Leu 
20 25 30 



Pro Ala Ala Pro Ser Gly Cys Pro Gin Leu cys Arg Cys Glu Gly Arg 
35 40 45 



Leu Leu Tyr Cys Glu Ala Leu Asn Leu Thr Glu Ala Pro His Asn Leu 
50 55 60 



Ser Gly Leu Leu Gly Leu Ser Leu Arg Tyr Asn Ser Leu Ser Glu Leu 
65 70 75 80 



Arg Ala Gly Gin Phe Thr Gly Leu Met Gin Leu Thr Trp Leu Tyr Leu 
85 90 95 



Asp His Asn His He Cys Ser Val Gin Gly Asp Ala Phe Gin Lys Leu 
100 105 110 



Arg Arg Val Lys Glu Leu Thr Leu Ser Ser Asn Gin He Thr Gin Leu 
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Pro Asn Thr Thr Phe Arg Pro Met pro Asn Leu Arg Ser Val Asp Leu 
130 135 140 



Ser Tyr Asn Lys Leu Gin Ala Leu Ala Pro Asp Leu Phe His Gly Leu 
145 150 155 160 



Arg Lys Leu Thr Thr Leu His Met Arg Ala Asn Ala lie Gin Phe Val 
165 170 175 



Pro Val Arg lie Phe Gin Asp Cys Arg Ser Leu Lys Phe Leu Asp lie 
180 185 190 



Gly Tyr Asn Gin Leu Lys Ser Leu Ala Arg Asn Ser Phe Ala Gly Leu 
195 200 205 



Phe Lys Leu Thr Glu Leu His Leu Glu His Asn Asp Leu Val Lys Val 
210 215 220 



Asn Phe Ala His Phe Pro Arg Leu Xle Ser Leu His Ser Leu Cys Leu 
225 230 235 " 240 



Arg Arg Asn Lys Val Ala lie Val Val Ser Ser Leu Asp Trp Val Trp 
245 250 255 



Asn Leu Glu Lys Met Asp Leu Ser Gly Asn Glu lie Glu Tyr Met Glu 
260 265 270 



Pro His Val Phe Glu Thr Val Pro His Leu Gin Ser Leu Gin Leu Asp 
275 280 285 



Ser Asn Arg Leu Thr Tyr He Glu Pro Arg lie Leu Asn Ser Trp Lys 
290 295 300 



Ser Leu Thr Ser lie Thr Leu Ala Gly Asn Leu Trp Asp Cys Gly Arg 
305 310 315 320 



Asn Val Cys Ala Leu Ala Ser Trp Leu Asn Asn Phe Gin Gly Arg Tyr 
325 330 335 



Asp Gly Asn Leu Gin Cys Ala Ser Pro Glu Tyr Ala Gin Gly Glu Asp 
340 345 350 



Val Leu Asp Ala Val Tyr Ala Phe His Leu Cys Glu Asp Gly Ala Glu 
355 360 365 
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Pro Thr Ser Gly His Leu Leu Ser Ala Val Thr Asn Arg Ser Asp Leu 
370 375 380 



Gly Pro Pro Ala Ser Ser Ala Thr Thr Leu Ala Asp Gly Gly Glu Gly 
3B5 390 395 400 



Gin His Asp Gly Thr Phe Glu Pro Ala Thr Val Ala Leu Pro Gly Gly 
405 410 415 



Glu His Ala Glu Asn Ala Val Gin lie His Lys Val Val Thr Gly Thr 
420 425 430 



Met Ala Leu lie Phe Ser Phe Leu lie Val Val Leu Val Leu Tyr Val 
435 440 445 



Ser Trp Lys Cys Phe Pro Ala Ser Leu Arg Gin Leu Arg Gin Cys Phe 
450 455 460 



Val Thr Gin Arg Arg Lys Gin Lys Gin Lys Gin Thr Met His Gin Met 
465 470 475 480 



Ala Ala Met Ser Ala Gin Glu Tyr Tyr Val Asp Tyr Lys Pro Asn His 
485 490 495 



lie Glu Gly Ala Leu Val Thr lie Asn Glu Tyr Gly Ser Cys Thr Cys 
500 505 " 510 



His Gin Gin Pro Ala Arg Glu Cys Glu Val 

520 





515 
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<212> 


PRT 


<213> 
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<400> 


104 



Met Ala Asn Ala Ser Glu Pro Gly Gly Ser Gly Gly Gly Glu Ala Ala 
1 5 10 15 



Ala Leu Gly Leu Lys Leu Ala Thr Leu Ser Leu Leu Leu Cys Val Ser 
20 25 30 



Leu Ala Gly Asn Val Leu Phe Ala Leu Leu lie Val Arg Glu Arg Ser 
35 40 45 
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Leu His Arg Ala Pro Tyr Tyr Leu Leu Leu Asp Leu Cys Leu Ala Asp 

50 55 60 

Gly Leu Arg Ala Leu Ala Cys Leu Pro Ala Val Met Leu Ala Ala Arg 
65 70 75 80 



Arg Ala Ala Ala Ala Ala Gly Ala Pro Pro Gly Ala Leu Gly Cys Lys 
85 90 95 



Leu Leu Ala Phe Leu Ala Ala Leu Phe Cys Phe His Ala Ala Phe Leu 
100 105 110 



Leu Leu Gly Val Gly Val Thr Arg Tyr Leu Ala He Ala His His Arg 
115 120 125 



Phe Tyr Ala Glu Arg Leu Ala Gly Trp Pro Cys Ala Ala Met Leu Val 
130 135 140 



Cys Ala Ala Trp Ala Leu Ala Leu Ala Ala Ala Phe Pro Pro Val Leu 
145 150 155 160 



Asp Gly Gly Gly Asp Asp Glu Asp Ala Pro cys Ala Leu Glu Gin Arg 
165 170 175 



Pro Asp Gly Ala Pro Gly Ala Leu Gly Phe Leu Leu Leu Leu Ala Val 
180 185 190 



Val Val Gly Ala Thr His Leu Val Tyr Leu Arg Leu Leu Phe Phe lie 
195 200 205 



His Asp Arg Arg Lys Met Arg Pro Ala Arg Leu Val Pro Ala Val Ser 
210' 215 220 



His Asp Trp Thr Phe His Gly Pro Gly Ala Thr Gly Gin Ala Ala Ala 
225 230 235 240 



Asn Trp Thr Ala Gly Phe Gly Arg Gly Pro Thr Pro Pro Ala Leu Val 
245 250 255 



Gly lie Arg Pro Ala Gly Pro Gly Arg Gly Ala Arg Arg Leu Leu Val 
260 265 210 



Leu Glu Glu Phe Lys Thr Glu Lys Arg Leu Cys Lys Met Phe Tyr Ala 
275 280 285 



Val Thr Leu Leu Phe Leu Leu Leu Trp Gly Pro Tyr Val Val Ala Ser 
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Tyr Leu Arg Val Leu Val Arg Pro Gly Ala Val Pro Gin Ala Tyr Leu 
305 310 315 320 



Thr Ala Ser Val Trp Leu Thr Phe Ala Gin Ala Gly He Asn Pro Val 
325 330 335 



Val Cys Phe Leu Phe Asn Arg Glu Leu Arg Asp Cys Phe Arg Ala Gin 
340 345 350 



Phe Pro Cys Cys Gin Ser Pro Arg Thr Thr Gin Ala Thr His Pro Cys 
355 360 365 



Asp Leu Lys Gly He Gly Leu 
370 375 



„<210> 105 

<211> 349 

<212> PRT 

<213> Homo Sapiens 

<400> 105 

Met Asn Arg Lys Ala Arg Arg Cys Leu Gly His Leu Phe Leu Ser Leu 
15 10 15 



Gly Met Val Tyr Leu Arg He Gly Gly Phe Ser Ser Val Val Ala Leu 
20 25 30 



Gly Ala Ser He He Cys Asn Lys He Pro Gly Leu Ala Pro Arg Gin 
35 40 45 



Arg Ala He Cys Gin Ser Arg Pro Asp Ala He He Val He Gly Glu 
50 55 60 



Gly Ser Gin Met Gly Leu Asp Glu Cys Gin Phe Gin Phe Arg Asn Gly 
65 70 75 80 



Arg Trp Asn Cys ser Ala Leu Gly Glu Arg Thr Val Phe Gly Lys Glu 
85 90 95 



Leu Lys Val Gly Ser Arg Glu Ala Ala Phe Thr Tyr Ala He He Ala 
100 105 110 



Ala Gly Val Ala His Ala He Thr Ala Ala Cys Thr Gin Gly Asn Leu 
115 120 125 
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Ser Asp Cys Gly Cys Asp Lys Glu Lys Gin Gly Gin Tyr His Arg Asp 
130 135 140 



Glu Gly Trp Lys Trp Gly Gly Cys Ser Ala Asp lie Arg Tyr Gly He 
145 "* ~ 150 155 160 



Gly Phe Ala Lys Val Phe Val Asp Ala Arg Glu He Lys Gin Asn Ala 
165 170 175 



Arg Thr Leu Met Asn Leu His Asn Asn Glu Ala Gly Arg Lys He Leu 
180 185 190 



Glu Glu Asn Met Lys Leu Glu Cys Lys Cys His Gly Val Ser Gly Ser 
195 200 205 



Cys Thr Thr Lys Thr Cys Trp Thr Thr Leu Pro Gin Phe Arg Glu Leu 
210 215 220 



Gly Tyr Val Leu Lys Asp Lys Tyr Asn Glu Ala Val His Val Glu Pro 
225 230 235 240 



Val Arg Ala Ser Arg Asn Lys Arg Pro Thr Phe Leu Lys He Lys Lys 
245 250 255 



Pro Leu Ser Tyr Arg Lys Pro Met Asp Thr Asp Leu Val Tyr He Glu 
260 265 270 



Lys Ser Pro Asn Tyr Cys Glu Glu Asp Pro Val Thr Gly Ser Val Gly 
275 280 285 



Thr Gin Gly Arg Ala Cys Asn Lys Thr Ala Pro Gin Ala Ser Gly Cys 
290 295 300 



Asp Leu Met Cys Cys Gly Arg Gly Tyr Asn Thr His Gin Tyr Ala Arg 
305 310 315 320 



Val Trp Gin Cys Asn Cys Lys Phe His Trp Cys Cys Tyr Val Lys Cys 
325 330 335 



Asn Thr Cys Ser Glu Arg Thr Glu Met Tyr Thr Cys Lys 
340 345 



<210> 106 

<211> 694 

<212> PRT 

<213> Homo Sapiens 



WO 2004/073657 



248/282 



PCT/US2004/005455 



<400> 106 

Met Glu Trp Gly Tyr Leu Leu Glu Val Thr Ser Leu Leu Ala Ala Leu 
15 10 15 



Ala Leu Leu Gin Arg Sex Ser Gly Ala Ala Ala Ala Ser Ala Lys Glu 
20 25 30 



Leu Ala Cys Gin Glu lie Thr Val Pro Leu Cys Lys Gly He Gly Tyr 
35 40 45 



Asn Tyr Thr Tyr Met Pro Asn Gin Phe Asn His Asp Thr Gin Asp Glu 
50 55 60 



Ala Gly Leu Glu Val His Gin Phe Trp Pro Leu Val Glu He Gin Cys 
65 70 75 80 



Ser Pro Asp Leu Lys Phe Phe Leu Cys Ser Met Tyr Thr Pro He Cys 
85 90 95 



Leu Glu Asp Tyr Lys Lys Pro Leu Pro Pro cys Arg Ser Val Cys Glu 
100 105 110 



Arg Ala Lys Ala Gly Cys Ala Pro Leu Met Arg Gin Tyr Gly Phe Ala 
115 120 125 



Trp Pro Asp Arg Met Arg Cys Asp Arg Leu Pro Glu Gin Gly Asn Pro 
130 135 140 . 



Asp Thr Leu Cys Met Asp Tyr Asn Arg Thr Asp Leu Thr Thr Ala Ala 
145 150 155 160 



Pro Ser Pro Pro Arg Arg Leu Pro Pro Pro Pro Pro Gly Glu Gin Pro 
165 170 175 



Pro Ser Gly Ser Gly His Gly Arg Pro Pro Gly Ala Arg Pro Pro His 
180 185 190 



Arg Gly Gly Gly Arg Gly Gly Gly Gly Gly Asp Ala Ala Ala Pro Pro 
195 200 205 



Ala Arg Gly Gly Gly Gly Gly Gly Lys Ala Arg Pro Pro Gly Gly Gly 
210 215 220 



Ala Ala Pro Cys Glu Pro Gly Cys Gin Cys Arg Ala Pro Met Val Ser 
225 230 235 240 
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Val Ser Ser Glu Arg His Pro Leu Tyr Asn Arg Val Lys Thr Gly Gin 
245 250 255 



He Ala Asn Cya Ala Leu Pro Cys His Asn pro Phe Phe Ser Gin Asp 
260 265 270 



Glu Arg Ala Phe Thr Val Phe Trp lie Gly Leu Trp Ser Val Leu Cys 
275 280 285 



Phe Val Ser Thr Phe Ala Thr Val Ser Thr Phe Leu He Asp Met Glu 
290 295 300 



Arg Phe Lys Tyr Pro Glu Arg Pro He He Phe Leu Ser Ala Cys Tyr 
305 310 315 320 



Leu Phe Val Ser Val Gly Tyr Leu Val Arg Leu Val Ala Gly His Glu 
325 330 335 



Lys Val Ala Cys Ser Gly Gly Ala Pro Gly Ala Gly Gly Ala Gly Gly 
340 345 350 



Ala Gly Gly Ala Ala Ala Gly Ala Gly Ala Ala Gly Ala Gly Ala Gly 
355 360 365 



Gly Pro Gly Gly Arg Gly Glu Tyr Glu Glu Leu Gly Ala Val Glu Gin 
370 375 380 



His Val Arg Tyr Glu Thr Thr Gly Pro Ala' Leu Cys Thr Val Val Phe 
385 - 390 395 400 



Leu Leu Val Tyr Phe Phe Gly Met Ala Ser Ser He Trp Trp Val He 
405 410 415 



Leu Ser . Leu Thr Trp Phe Leu Ala Ala Gly Met Lys Trp Gly Asn Glu 
420 425 430 



Ala He Ala Gly Tyr Ser Gin Tyr Phe His Leu Ala Ala Trp Leu Val 
435 440 445 



Pro Ser Val Lys Ser He Ala Val Leu Ala Leu Ser Ser Val Asp Gly 
450 455 460 



Asp Pro Val Ala Gly He Cys Tyr Val Gly Asn Gin Ser Leu Asp Asn 
465 470 475 480 
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Leu Arg Gly Phe Val Leu Ala Pro Leu Val lie Tyr Leu Phe lie Gly 
485 490 495 



Thr Met Pbe Leu Leu Ala Gly Phe Val Ser Leu Phe Arg lie Arg Ser 
500 505 510 



Val lie Lys Gin Gin Asp Gly Pro Thr Lys Thr His Lys Leu Glu Lys 
515 520 525 



Leu Met lie Arg Leu Gly Leu Phe Thr Val Leu Tyr Thr Val Pro Ala 
530 535 540 



Ala Val Val Val Ala Cys Leu Phe Tyr Glu Gin His Asn Arg Pro Arg 
545 550 555 560 



Trp Glu Ala Thr His Asn Cys Pro Cys Leu Arg Asp Leu Gin Pro Asp 
565 570 ~ 575 



Gin Ala Arg Arg Pro Asp Tyr Ala Val Phe Met Leu Lys Tyr Phe Met 
580 585 590 



Cys Leu Val Val Gly lie Thr Ser Gly Val Trp Val Trp Ser Gly Lys 
595 600 605 



Thr Leu Glu Ser Trp Arg Ser Leu Cys Thr Arg Cys Cys Trp Ala Ser 
610 615 620 



Lys Gly Ala Ala Val Gly Gly Gly Ala Gly Ala Thr Ala Ala Gly Gly 
625 630 635 640 



Gly Gly Gly Pro Gly Gly Gly Gly Gly Gly Gly Pro Gly Gly Gly Gly 
645 650 655 



Gly Pro Gly Gly Gly Gly Gly Ser Leu Tyr Ser Asp Val Ser Thr Gly 
660 665 670 



Leu Thr Trp Arg Ser Gly Thr Ala Ser Ser Val Ser Tyr Pro Lys Gin 
675 680 685 



Met Pro Leu Ser Gin Val 
690 
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<400> 107 

Met Leu Gin Gly Pro Gly Ser Leu Leu Leu Leu Phe Leu Ala Ser His 
15 io 15 



Cys Cys Leu Gly Ser Ma Arg Gly Leu Phe Leu Phe Gly Gin Pro Asp 
20 25 30 



Phe Ser Tyr Lys Arg Ser Asn Cys Lys Pro lie Pro Ala Asn Leu Gin 
35 40 45 



Leu Cys His Gly lie Glu Tyr Gin Asn Met Arg Leu Pro Asn Leu Leu 
50 55 60 



Gly His Glu Thr Met Lys Glu Val Leu Glu Gin Ala Gly Ala Trp lie 
65 70 75 80 



Pro Leu Val Met Lys Gin Cys His Pro Asp Thr Lys Lys Phe Leu Cys 
85 90 95 



Ser Leu Phe Ala Pro Val Cys Leu Asp Asp Leu Asp Glu Thr lie Gin 
100 105 110 



Pro Cys His Ser Leu Cys Val Gin Val Lys Asp Arg Cys Ala Pro Val 
115 120 125 



Met Ser Ala Phe Gly Phe Pro Trp Pro Asp Met Leu Glu Cys Asp Arg 
130 135 140 



Phe Pro Gin Asp Asn Asp Leu Cys He Pro Leu Ala Ser Ser Asp His 
145 150 155 160 



Leu Leu Pro Ala Thr Glu Glu Ala Pro Lys Val Cys Glu Ala Cys Lys 
165 170 175 



Asn Lys Asn Asp Asp Asp Asn Asp lie Met Glu Thr Leu Cys Lys Asn 
180 185 190 



Asp Phe Ala Leu Lys He Lys Val Lys Glu He Thr Tyr He Asn Arg 
195 200 205 



Asp Thr Lys lie He Leu Glu Thr Lys Ser Lys Thr He Tyr Lys Leu 
210 215 220 



Asn Gly Val Ser Glu Arg Asp Leu Lys Lys Ser Val Leu Trp Leu Lys 
225 230 235 * 240 
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Asp Ser Leu Gin Cys Thr Cys Glu Glu Met Asn Asp He Asn Ala Pro 
245 250 255 



Tyr Leu Val Met Gly Gin Lys Gin Gly Gly Glu Leu Val He Thr Ser 
260 265 270 



Val Lys Arg Trp Gin Lys Gly Gin Arg Glu Phe Lys Arg He Ser Arg 
275 280 285 



Ser He Arg Lys Leu Gin Cys 
290 295 



<:210> 108 

<211> 328 
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<400> 108 

Met Gly Phe Trp He Leu Ala He Leu Thr He Leu Met Tyr Ser Thr 
1 " 5 10 15 



Ala Ala Lys Phe Ser Lys Gin Ser Trp Gly Leu Glu Asn Glu Ala Leu 
20 25 30 



He Val Arg Cys Pro Arg Gin Gly Lys Pro Ser Tyr Thr Val Asp Trp 
35 40 45 



Tyr Tyr Ser Gin Thr Asn Lys Ser He Pro Thr Gin Glu Arg Asn Arg 
50 55 60 



Val Phe Ala Ser Gly Gin Leu Leu Lys Phe Leu Pro Ala Ala Val Ala 
65 70 75 80 



Asp Ser Gly He Tyr Thr Cys He Val Arg Ser Pro Thr Phe Asn Arg 
85 90 95 



Thr Gly Tyr Ala Asn Val Thr He Tyr Lys Lys Gin Ser Asp Cys Asn 
100 105 110 



Val Pro Asp Tyr Leu Met Tyr Ser Thr Val Ser Gly Ser Glu hys Asn 
115 120 125 



Ser Lys He Tyr Cys Pro Thr He Asp Leu Tyr Asn Trp Thr Ala Pro 
130 135 140 
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Leu Glu Trp Phe Lys Asn Cys Gin Ala Leu Gin Gly Ser Arg Tyr Arg 
145 150 155 160 

Ala His Lys Ser Phe Leu Val He Asp Asn Val Met Thr Glu Asp Ala 
165 170 175 

Gly Asp Tyr Thr Cys Lys Phe He His Asn Glu Asn Gly Ala Asn Tyr 
180 185 190 

Ser Val Thr Ala Thr Arg Ser Phe Thr Val Lys Asp Glu Gin Gly Phe 
195 200 205 

Ser Leu Phe Pro Val He Gly Ala Pro Ala Gin Asn Glu He Lys Glu 
210 215 220 

Val Glu He Gly Lys Asn Ala Asn Leu Thr Cys Ser Ala Cys Phe Gly 
225 " 230 235 240 

Lys Gly Thr Gin Phe Leu Ala Ala Val Leu Trp Gin Leu Asn Gly Thr 
245 250 255 

Lys He Thr Asp Phe Gly Glu Pro Arg He Gin Gin Glu Glu Gly Gin 
260 265 270 



Asn Gin Ser Phe Ser Asn Gly Leu Ala Cys Leu Asp Met Val Leu Arg 
275 280 285 



lie Ala Asp Val Lys Glu Glu Asp Leu Leu Leu Gin Tyr Asp Cys Leu 
290 295 300 



Ala Leu Asn Leu His Gly Leu Arg Arg His Thr Val Arg Leu Ser Arg 
305 310 315 320 



Lys Asn Pro Ser Lys Glu Cys Phe 
325 



<210> 109 

<211> 89 

<212> PRT 

<213> Homo Sapiens 

<400?> 109 

Met Lys Gly Leu Ala Ala Ala Leu Leu Val Leu Val Cys Thr Met Ala 
15 10 15 



Leu Cys Ser Cys Ala Gin Val Gly Thr Asn Lys Glu Leu Cys Cys Leu 
20 25 30 
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Val Tyr Thr Ser Trp Gin lie Pro Gin Lys Phe He Val Asp Tyr Ser 
35 40 ^ 45 



Glu Thr Ser Pro Gin Cys Pro Lys pro Gly Val He Leu Leu Thr Lys 
50 55 60 



Arg Gly Arg Gin He Cys Ala Asp Pro Asn Lys Lys Trp Val Gin Lys 
65 70 75 80 



Tyr He Ser Asp Leu Lys Leu Asn Ala 
85 



<210> 110 

<211> 540 

<212> PRT 

<213> Homo Sapiens 

<400> 110 

Met Ala Thr Ala Pro Gly Pro Ala Gly He Ala Met Gly Ser Val Gly 
15 10 15 



Ser Leu Leu Glu Arg Gin Asp Phe Ser Pro Glu Glu Leu Arg Ala Ala 
20 25 30 



Leu Ala Gly Ser Arg Gly Ser Arg Gin Pro Asp Gly Leu Leu Arg Lys 
35 40 45 



Gly Leu Gly Gin Arg Glu Phe Leu Ser Tyr Leu His Leu Pro Lys Lys 
50 55 60 



Asp Ser Lys Ser Thr Lys Asn Thr Lys Arg Ala Pro Arg Asn Glu Pro 
65 70 75 80 



Ala Asp Tyr Ala Thr Leu Tyr Tyr Arg Glu His Ser Arg Ala Gly Asp 
85 90 95 



Phe Ser Lys Thr Ser Leu Pro Glu Arg Gly Arg Phe Asp Lys Cys Arg 
100 105 110 



lie Arg Pro Ser Val Phe Lys Pro Thr Ala Gly Asn Gly hys Gly Phe 
115 120 125 



Leu Ser Met Gin Ser Leu Ala Ser His Lys Gly Gin Lys Leu Trp Arg 
130 135 140 
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Ser Asn Gly Ser Leu His Thr Leu Ala cys His Pro Pro Leu Ser Pro 
145 150 155 160 



Gly Pro Arg Ala Ser Gin Ala Arg Ala Gin Leu Leu His Ala Leu Ser 
165 170 175 



Leu Asp Glu Gly Gly Pro Glu Pro Glu Pro Ser Leu Ser Asp Ser Ser 
180 185 190 



Ser Gly Gly Ser Phe Gly Arg Ser Pro Gly Thr Gly Pro Ser Pro Phe 
195 200 ' 205 



Ser Ser Ser Leu Gly His Leu Asn His Leu Gly Gly Ser Leu Asp Arg 
210 215 220 



Ala Ser Gin Gly Pro Lys Glu Ala Gly Pro Pro Ala Val Leu Ser Cys 
225 230 235 240 



Leu Pro Glu Pro Pro Pro Pro Tyr Glu Phe Ser Cys Ser Ser Ala Glu 
245 250 255 



Glu Met Gly Ala Val Leu Pro Glu Thr Cys Glu Glu Leu Lys Arg Gly 
260 265 270 



Leu Gly Asp Glu Asp Gly Ser Asn Pro Phe Thr Gin Val Leu Glu Glu 
275 280 285 



Arg Gin Arg Leu Trp Leu Ala Glu Leu Lys Arg Leu Tyr Val Glu Arg 
290 295 300 



Leu His Glu Val Thr Gin Lys Ala Glu Arg Ser Glu Arg Asn Leu Gin 
305 310 315 320 



Leu Gin Leu Phe Met Ala Gin Gin Glu Gin Arg Arg Leu. Arg Lys Glu 
325 330 335 



Leu Arg Ala Gin Gin Gly Leu Ala Pro Glu Pro Arg Ala Pro Gly Thr 
340 345 350 



Leu Pro Glu Ala Asp Pro Ser Ala Arg Pro Glu Glu Glu Ala Arg Trp 
355 360 365 



Glu Val Cys Gin Lys Thr Ala Glu lie Ser Leu Leu Lys Gin Gin Leu 
370 375 3 80 



Arg Glu Ala Gin Ala Glu Leu Ala Gin Lys Leu Ala Glu lie Phe Ser 



WO 2004/073657 
385 390 



256/282 

395 



PCT/US2004/005455 
400 



Leu Lys Thr Gin Leu Arg Gly Ser Arg Ala Gin Ala Gin Ala Gin Asp 
405 410 415 



Ala Glu Leu Val Arg Leu Arg Glu Ala Val Arg Ser Leu Gin. Glu Gin 
420 425 430 



Ala Pro Arg Glu Glu Ala Pro Gly Ser Cys Glu Thr Asp Asp Cys Lys 
435 440 445 



Ser Arg Gly Leu Leu Gly Glu Ala Gly Gly Ser Glu Ala Arg Asp Ser 
450 ' 455 460 



Ala Glu Gin Leu Arg Ala Glu Leu Leu Gin Glu Arg Leu Arg Gly Gin 
465 470 475 480 



Glu Gin Ala Leu Arg Phe Glu Gin Glu Arg Arg Thr Trp Gin Glu Glu 
485 490 495 



Lys Glu Arg Val Leu Arg Tyr Gin Arg Glu lie Gin Gly Gly Tyr Met 
500 505 510 



Asp Met Tyr Arg Arg Asn Gin Ala Leu Glu Gin Glu Leu Arg Ala Leu 
515 520 525 



Arg Glu Pro Pro Thr Pro Trp Ser Pro Arg Leu Glu 
530 535 540 



<210> 111 

<211> 673 

<212> PRT 

<213> Homo Sapiens 

<400> 111 

Met Pro Gly Gin Lys Phe Phe Leu Glu Val Leu Cys Cys Pro Ser Lys 
15 10 15 



Asn Trp Arg Ser Ser Ala Ala Glu Arg Val Pro Pro Ser Pro lie Arg 
20 25 30 



Leu Arg Arg Arg Arg Pro Pro Ala Phe Ser Arg Arg Leu Pro Leu Arg 
35 ^40 '45 



Arg Ser Asp Pro Ala Arg Ser Pro Gly Pro Ser Arg Arg Leu Ala Gly 
50 55 60 



WO 2U04/073657 
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Gly Phe Lys Ser Ala Arg Gly Ser Cys Asp Ala Gin Gly Leu Arg Ser 
65 70 75 80 



Arg Gly Pro Ala Ser Ala Ser Pro Pro Trp Ala Ala Val Ser Ser He 
85 90 95 



Ser Thr Lys Asp Trp Ser. Glu Ser Asn Ser Ser Pro Cys Ser Glu He 
100 105 110 



Pro Val Leu Pro Ala Asn Leu Gly Asp Trp Arg Gly He Trp Trp Gly 
115 120 125 



Thr Trp Gin Glu Ala Pro Gly Pro Ala Gly He Ala Met Gly Ser Val 
130 135 140 



Gly Ser Leu Leu Glu Arg Gin Asp Phe Ser Pro Glu Glu Leu Arg Ala 
145 150 155 160 



Ala Leu Ala Gly Ser Arg Gly Ser Arg Gin Pro Asp Gly Leu Leu Arg 
165 170 175 



Lys Gly Leu Gly Gin Arg Glu Phe Leu Ser Tyr Leu His Leu Pro Lys 
160 185 190 



Lys Asp Ser Lys Ser Thr Lys Asn Thr Lys Arg Ala Pro Arg Asn Glu 
195 200 205 



Pro Ala Asp Tyr Ala Thr Leu Tyr Tyr Arg Glu His Ser Arg Ala Gly 
210 215 220 



Asp Phe Ser Lys Thr Ser Leu Pro Glu Arg Gly Arg Phe Asp Lys Cys 
225 230 235 240 



Arg He Arg Pro Ser Val Phe Lys Pro Thr Ala Gly Asn Gly Lys Gly 
245 250 255 



Phe Leu Ser Met Gin Ser Leu Ala Ser His Lys Gly Gin Lys Leu Trp 
260 265 270 



Arg Ser Asn Gly Ser Leu His Thr Leu Ala Cys His Pro Pro Leu Ser 
275 2B0 285 



Pro Gly Pro Arg Ala Ser Gin Ala Arg Ala Gin Leu Leu His Ala 
290 295 300 



Leu 
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Ser Leu Asp Glu Gly Gly Pro Glu Pro Glu Pro Ser Leu ser Asp Ser 

305 310 315 320 



Ser Ser Gly Gly Ser Phe Gly Arg Ser Pro Gly Thr Gly Pro Ser Pro 
325 330 335 



Phe Ser Ser Ser Leu Gly His Leu Asn His Leu Gly Gly ser Leu Asp 
340 345 350 



Arg Ala Ser Gin Gly Pro Lys Glu Ala Gly Pro Pro Ala Val Leu Ser 
355 360 365 



Cys Leu Pro Glu Pro Pro Pro Pro Tyr Glu Phe Ser Cys Ser Ser Ala 
370 375 380 



Glu Glu Met Gly Ala Val Leu Pro Glu Thr Cys Glu Glu Leu Lys Arg 
385 390 395 400 



Gly Leu Gly Asp Glu Asp Gly Ser Asn Pro Phe Thr Gin Val Leu Glu 
405 410 415 



Glu Arg Gin Arg Leu Trp Leu Ala Glu Leu Lys Arg Leu Tyr Val Glu 
420 425 430 



Arg Leu His Glu Val Thr Gin Lys Ala Glu Arg Ser Glu Arg Asn Leu 
435 440 ~ 445 



Gin Leu Gin Leu Phe Met Ala Gin Gin Glu Gin Arg Arg Leu Arg Lys 
450 455 460 



Glu Leu Arg Ala Gin Gin Gly Leu Ala Pro Glu Pro Arg Ala Pro Gly 
465 470 475 " 480 



Thr Leu Pro Glu Ala Asp Pro Ser Ala Arg Pro Glu Glu Glu Ala Arg 
485 490 495 



Trp Glu Val Cys Gin Lys Thr Ala Glu lie Ser Leu Leu Lys Gin Gin 
500 505 510 



Leu Arg Glu Ala Gin Ala Glu Leu Ala Gin Lys Leu Ala Glu lie Phe 
515 520 525 



Ser Leu Lys Thr Gin Leu Arg Gly Ser Arg Ala Gin Ala Gin Ala Gin 
530 535 540 



Asp Ala Glu Leu Val Arg Leu Arg Glu Ala Val Arg Ser Leu Gin Glu 
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545 550 555 560 



Gin Ala Pro Arg Glu Glu Ala Pro Gly Ser Cys Glu Thr Asp Asp Cys 
565 570 575 



I»ys Ser Arg Gly Leu Leu Gly GJu Ala Gly Gly Ser Glu Ala Arg Asp 
580 585 590 



Ser Ala Glu Gin Leu Arg Ala Glu Leu Leu Gin Glu Arg Leu Arg Gly 
595 600 605 



Gin Glu Gin Ala Leu Arg Phe Glu Gin Glu Arg Arg Thr Trp Gin Glu 
610 615 620 



Glu Lys Glu Arg Val Leu Arg Tyr Gin Arg Glu He Gin Gly Gly Tyr 
625 630 635 640 



Met Asp Met Tyr Arg Arg Asn-Gln Ala Leu Glu Gin Glu Leu Arg Ala 
645 650 655 



Leu Arg Glu Pro Pro Thr Pro Trp Ser Pro Arg Leu Glu Ser Ser Lys 
660 665 670 



He 



<210> 112 

<211> 998 

<212> PRT 

<213> Homo Sapiens 

<400> 112 

Met Ala Arg Ala Arg Pro Pro Pro Pro Pro Ser Pro Pro Pro Gly Leu 
15 10 15 



Leu Pro Leu Leu Pro Pro Leu Leu Leu Leu Pro Leu Leu Leu Leu Pro 
20 25 30 



Ala Gly Cys Arg Ala Leu Glu Glu Thr Leu Met Asp Thr Lys Trp Val 
35 40 ~ 45 



Thr Ser Glu Leu Ala Trp Thr Ser His Pro Glu Ser Gly Trp Glu Glu 
50 55 60 



Val Ser Gly Tyr Asp Glu Ala Met Asn Pro He Arg Thr Tyr Gin Val 
65 70 75 80 



WO 2004/073657 
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Cys Asn Val Arg Glu Ser Ser Gin Asn Asn Trp Leu Arg Thr Gly Phe 
85 90 95 



He Trp Arg Arg Asp Val Gin Arg val Tyr Val Glu Leu Lys Phe Thr 
100 105 110 



Val Arg Asp Cys Asn Ser He Pro Asn He Pro Gly Ser Cys Lys Glu 
115 120 125 



Thr Phe Asn Leu Phe Tyr Tyr Glu Ala Asp Ser Asp Val Ala Ser Ala 
130 135 140 



Ser Ser Pro Phe Trp Met Glu Asn Pro Tyr Val Lys Val Asp Thr He 
145 150 155 160 



Ala Pro Asp Glu Ser Phe Ser Arg Leu Asp Ala Gly Arg Val Asn Thr 
165 170 175 



Lys Val Arg Ser Phe Gly Pro Leu Ser Lys Ala Gly Phe Tyr Leu Ala 
180 185 190 



Phe Gin Asp Gin Gly Ala Cys Met Ser Leu He Ser Val Arg Ala Phe 
195 200 205 



Tyr Lys Lys Cys Ala Ser Thr Thr Ala Gly Phe Ala Leu Phe Pro Glu 
210 215 220 



Thr Leu Thr Gly Ala Glu Pro Thr Ser Leu Val He Ala Pro Gly Thr 
225 230 235 240 



Cys He Pro Asn Ala Val Glu Val Ser Val Pro Leu Lys Leu Tyr Cys 
245 250 255 



Asn Gly Asp Gly Glu Trp Met Val Pro Val Gly Ala Cys Thr Cys Ala 
260 265 270 



Thr Gly His Glu Pro Ala Ala Lys Glu Ser Gin Cys Arg Pro Cys Pro 
275 280 285 



Pro Gly Ser Tyr Lys Ala Lys Gin Gly Glu Gly Pro Cys Leu Pro Cys 
290 295 300 



Pro Pro Asn Ser Arg Thr Thr Ser Pro Ala Ala Ser He Cys Thr Cys 
305 310 315 320 
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His Asn Asn Phe Tyr Arg Ala Asp Ser Asp Ser Ala Asp Ser Ala Cys 
325 330 335 



Thr Thr Val Pro Ser Pro Pro Arg Gly Val He Ser Asn Val Asn Glu 
340 345 350 



Thr Ser Levi lis Leu Glu Trp Ser Glu Pro Arg Asp Leu Gly Gly Arg 
355 360 365 



Asp Asp Leu Leu Tyr Asn Val lie Cys Lys Lys Cys His Gly Ala Gly 
370 375 380 



Gly Ala Ser Ala Cys Ser Arg Cys Asp Asp Asn Val Glu Phe Val Pro 
385 390 395 400 



Arg Gin Leu Gly Leu Thr Glu Arg Arg Val His He Ser His Leu Leu 
405 410 415 



Ala His Thr Arg Tyr Thr Phe Glu Val Gin Ala Val Asn Gly Val Ser 
420 425 430 



Gly Lys Ser Pro Leu Pro Pro Arg Tyr Ala Ala Val Asn He Thr Thr 
435 440 445 



Asn Gin Ala Ala Pro Ser Glu Val Pro Thr Leu Arg Leu His Ser Ser 
450 455 460 



Ser Gly Ser Ser Leu Thr Leu Ser Trp Ala Pro Pro Glu Arg Pro Asn 
465 470 475 480 



Gly Val He Leu Asp Tyr Glu Met Lys Tyr Phe Glu Lys Ser Glu Gly 
485 490 495 



He Ala Ser Thr Val Thr Ser Gin Met Asn Ser Val Gin Leu Asp Gly 
500 505 510 



Leu Arg Pro Asp Ala Arg Tyr Val Val Gin Val Arg Ala Arg Thr Val 
515 520 525 



Ala Gly Tyr Gly Gin Tyr Ser Arg Pro Ala Glu Phe Glu Thr Thr Ser 
530 535 540 



Glu Arg Gly Ser Gly Ala Gin Gin Leu Gin Glu Gin Leu Pro Leu He 
545 550 555 560 



Val Gly Ser Ala Thr Ala Gly Leu Val Phe Val Val Ala Val Val Val 



WO 2004/073657 

565 



262/282 
570 
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575 



He Ala He Val Cys Leu Arg Lys Gin Arg His Gly Ser Asp Ser Glu 
580 585 590 



Tyr Thr Glu Lys Leu Gin Gin Tyr He Ala Pro Gly Met Lys Val Tyr 
595 600 605 



He Asp Pro Phe Thr Tyr Glu Asp Pro Asn Glu Ala Val Arg Glu Phe 
610 615 620 

Ala Lys Glu He Asp Val Ser Cys Val Lys He Glu Glu Val He Gly 
625 630 635 640 



Ala Gly Glu Phe Gly Glu Val Cys Arg Gly Arg Leu Lys Gin Pro Gly 
645 650 655 



Arg Arg Glu Val Phe Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr 
660 665 670 



Glu Arg Gin Arg Arg Asp Phe Leu Ser Glu Ala Ser He Met Gly Gin 
675 680 685 



Phe Asp His Pro Asn He He Arg Leu Glu Gly Val Val Thr Lys Ser 
690 695 700 



Arg Pro Val Met He Leu Thr Glu Phe Met Glu Asn Cys Ala Leu Asp 
705 710 715 720 

Ser Phe Leu Arg Leu Asn Asp Gly Gin Phe Thr Val He Gin Leu Val 
725 730 735 



Gly Met Leu Arg Gly He Ala Ala Gly Met Lys Tyr Leu Ser Glu Met 
740 745 750 



Asn Tyr Val His Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser 
755 760 765 



Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser Arg Phe Leu Glu 
770 775 780 



Asp Asp Pro Ser Asp Pro Thr Tyr Thr Ser Ser Leu Gly Gly Lys He 
785 790 795 800 



Pro He Arg Trp Thr Ala Pro Glu Ala He Ala Tyr Arg Lys Phe Thr 
805 810 815 
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S^r Ala Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu Val Met 
820 825 



Ser Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val lie 
&35 840 B45 



Asn Ala Val Glu Gin Asp Tyr Arg Leu Pro Pro Pro Met Asp Cys Pro 
850 855 860 



Thr Ala Leu His Gin Leu Met Leu Asp Cys Trp Val Arg Asp Arg Asn 
865 870 875 880 



Leu Arg Pro Lys Phe Ser Gin He Val Asn Thr Leu Asp Lys Leu He 
8 85 890 895 



Arg Asn Ala Ala Ser Leu Lys Val He Ala Ser Ala Gin Ser Gly Met 
900 905 910 



Ser Gin Pro Leu Leu Asp Arg Thr Val Pro Asp Tyr Thr Thr Phe Thr 
915 920 925 



Thr Val Gly Asp Trp Leu Asp Ala He Lys Met Gly Arg Tyr Lys Glu 
930 935 940 



Ser Phe Val Ser Ala Gly Phe Ala Ser Phe Asp Leu Val Ala Gin Met 
945 950 955 960 



Thr Ala Glu Asp Leu Leu Arg He Gly Val Thr Leu Ala Gly His Gin 
965 970 975 



Lys Lys He Leu Ser Ser He Gin Asp Met Arg Leu Gin Met Asn Gin 
980 985 990 



Thr Leu Pro Val Gin Val 





995 


<210> 


113 


<211> 


413 


*212> 


PRT 


*213> 


Homo Sapiens 


<400> 


113 



Met Gly Gly Thr Thr Leu Ala Trp Ser Met Ala Arg Asp Ser Ala Gly 
1 5 10 15 
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Leu Val Ala Gly Asn Leu Asp Leu Ser Glu Lys His Asp Pro Arg Pro 
20 25 30 



Pro Pro Leu Leu His Pro Pro Gly Pro Thr Ala Val Leu Ala Gly Asp 
35 40 45 



Gly Ser Phe Arg Lys Cys Ala Glu Lys Ser Thr Phe Pro Cys Gin Ala 
50 55 60 



Thr Ala Arg Glu Leu Thr Pro Leu Phe Glu Pro Cys Gin Pro Pro His 
65 70 75 80 



Leu Val Gly Arg Val Lys Gly Arg Glu Val Asn Thr Ala Pro Thr Pro 
85 90 95 



Leu Pro Cys Arg Pro Ser Gly Arg Pro Val Ala Gly Gly Gly Gly Asp 
100 * 105 110 



Gly Pro Gly Gly Pro Glu Pro Gly Trp Val Asp Pro Arg Thr Trp Leu 
115 ^ 120 125 



Ser Phe Gin Gly Pro Pro Gly Gly Pro Gly lie Gly Pro Gly Val Gly 
130 135 140 



Pro Gly Ser Glu Val Trp Gly He Pro Pro Cys Pro Pro Pro Tyr Glu 
145 150 155 160 



Phe Cys Gly Gly Met Ala Tyr Cys Gly Pro Gin Val Gly Val Gly Leu 
165 170 175 



Val Pro Gin Gly Gly Leu Glu Thr Ser Gin Pro Glu Gly Glu Ala Gly 
180 185 190 



Val Gly Val Glu Ser Asn Ser Asp Gly Ala Ser Pro Glu Pro Cys Thr 
195 200 205 



Val Thr Pro Gly Ala Val Lys Leu Glu Lys Glu Lys Leu Glu Gin Asn 
210 215 220 



Pro Glu Glu Ala Arg Lys Val Phe Ser Gin Thr Thr lie Cys Arg Phe 
225 230 235 240 



Glu Ala Leu Gin Leu Ser Phe Lys Asn Met Cys Lys Leu Arg Pro Leu 
245 250 255 



Leu Gin Lys Trp Val Glu Glu Ala Asp Asn Asn Glu Asn Leu Gin Glu 
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260 265 270 



lie Cys Lys Ala Glu Thr Leu Val Gin Ala Arg Lys Arg Lys Arg Thr 
275 280 285 



Ser He Glu Asn Arg Val Arg Gly Asn Leu Glu Asn Leu Phe Leu Gin 
290 295 300 



Cys Pro Lys Pro Thr Leu Gin Gin lie Ser His lie Ala Gin Gin Leu 
305 310 315 320 



Gly Leu Glu Lys Asp Val Val Arg Val Trp Phe Cys Asn Arg Arg Gin 
325 330 335 



Lys Gly Lys Arg Ser Ser Ser Asp Tyr Ala Gin Arg Glu Asp Phe Glu 
340 345 350 



Ala Ala Gly Ser Pro Phe Ser Gly Gly Pro Val Ser Phe Pro Leu Ala 
355 360 365 



Pro Gly Pro His Phe Gly Thr Pro Gly Tyr Gly Ser Pro His Phe Thr 
370 375 380 



Ala Leu Tyr Ser Ser Val Pro Phe Pro Glu Gly Glu Ala Phe Pro Pro 
385 390 395 400 



Val Ser Val Thr Thr Leu Gly Ser Pro Met His Ser Asn 
405 410 



<210> 114 

<211> 360 

<212> PRT 

<213> Homo Sapiens 

<400> 114 

Met Ala Gly His Leu Ala Ser Asp Phe Ala Phe Ser Pro Pro Pro Gly 
1 5 10 15 



Gly Gly Gly Asp Gly Pro Gly Gly Pro Glu Pro Gly Trp Val Asp Pro 
20 25 30 



Arg Thr Trp Leu Ser Phe Gin Gly Pro Pro Gly Gly Pro Gly lie Gly 
35 40 45 



Pro Gly Val Gly Pro Gly Ser Glu Val Trp Gly lie Pro Pro Cys Pro 
50 55 60 
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Pro Pro Tyr Glu Phe Cys Gly Gly Met Ala Tyr Cys Gly .Pro Gin Val 
65 70 75 80 



Gly Val Gly Leu Val Pro Gin Gly Gly Leu Glu Thr Ser Gin Pro Glu 
85 90 95 



Gly Glu Ala Gly Val Gly Val Glu Ser Asn ser Asp Gly Ala Ser Pro 
100 105 110 



Glu Pro Cys Thr Val Thr Pro Gly Ala Val Lys Leu Glu Lys Glu Lys 
115 120 125 



Leu Glu Gin Asn Pro Glu Glu Ser Gin Asp lie Lys Ala Leu Gin Lys 
130 135 140 



Glu Leu Glu Gin Phe Ala Lys Leu Leu Lys Gin Lys Arg He Thr Leu 
145 150 155 160 



Gly Tyr Thr Gin Ala Asp Val Gly Leu Thr Leu Gly Val Leu Phe Gly 
165 170 175 



Lys Val Phe Ser Gin Thr Thr lie Cys Arg Phe Glu Ala Leu Gin Leu 
180 185 190 



Ser Phe Lys Asn Met Cys Lys Leu Arg Pro Leu Leu Gin Lys Trp Val 
195 200 205 



Glu Glu Ala Asp Asn Asn Glu Asn Leu Gin Glu lie Cys Lys Ala Glu 
210 215 220 



Thr Leu Val Gin Ala Arg Lys Arg Lys Arg Thr Ser He Glu Asn Arg 
225 230 235 240 



Val Arg Gly Asn Leu Glu Asn Leu Phe Leu Gin Cys Pro Lys Pro Thr 
245 250 255 



Leu Gin Gin lie Ser His He Ala Gin Gin Leu Gly Leu Glu Lys Asp 
260 265 270 



Val Val Arg Val Trp Phe Cys Asn Arg Arg Gin Lys Gly Lys Arg Ser 
275 280 285 



Ser Ser Asp Tyr Ala Gin Arg Glu Asp Phe Glu Ala Ala Gly Ser Pro 
290 295 300 
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Phe Ser Gly Gly Pro Val Ser Phe Pro Leu Ala Pro Gly Pro His Phe 
305 310 315 320 

Gly Thr Pro Gly Tyr Gly Ser Pro His Phe Thr Ala Leu Tyr Ser Ser 
325 ~ 330 335 



Val Pro Phe Pro Glu Gly Glu Ala Phe Pro Pro Val Ser Val Thr Thr 
340 345 350 



Leu Gly Ser Pro Met His Ser Asn 

360 





355 


<210> 


115 


*211> 


529 


<212> 


PRT 


<213> 


Homo Sapiens 


<400> 


115 



Met Ser Val Lys Trp Thr Ser Val lie Leu Leu lie Gin Leu Ser Phe 
15 10 15 



Cys Phe Ser Ser Gly Asn Cys Gly Lys Val Leu Val Trp Ala Ala Glu 
20 25 30 



Tyr Ser His Trp Met Asn lie Lys Thr lie Leu Asp Glu Leu lie Gin 
35 40 45 



Arg Gly His Glu Val* Thr Val Leu Ala Ser Ser Ala Ser lie Leu Phe 
50 55 60 



Asp Pro Asn Asn Ser Ser Ala Leu Lys lie Glu lie Tyr Pro Thr Ser 
65 70 75 80 



Leu Thr Lys Thr Glu Leu Glu Asn Phe lie Met Gin Gin lie Lys Arg 
85 90 95 



Trp Ser Asp Leu Pro Lys Asp Thr Phe Trp Leu Tyr Phe Ser Gin Val 
100 105 110 



Gin Glu lie Met Ser He Phe Gly Asp He Thr Arg Lys Phe Cys Lys 
115 120 125 



Asp Val Val Ser Asn Lys Lys Phe Met Lys Lys Val Gin Glu Ser Arg 
130 135 140 



Phe Asp Val He Phe Ala Asp Ala He Phe Pro Cys Ser Glu fceu Leu 
145 150 155 160 
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Ala Glu Leu Phe Asn lie Pro Phe Val Tyr Ser Leu Ser Phe Ser Pro 
165 170 V?5 



Gly Tyr Thr Phe Glu Lys His Ser Gly Gly Phe He Phe Pro Pro Ser 
180 185 190 



Tyr Val Pro Val Val Met Ser Glu Leu Thr Asp Gin Met Thr Phe Met 
195 200 205 

Glu Arg Val Lys Asn Met He Tyr Val Leu Tyr Phe Asp Phe Trp Phe 
210 215 220 

Glu lie Phe Asp Met Lys Lys Trp Asp Gin Phe Tyr Ser Glu Val Leu 
225 230 235 240 



Gly Arg Pro Thr Thr Leu Ser Glu Thr Met Gly Lys Ala Asp Val Trp 
245 250 255 



Leu He Arg Asn Ser Trp Asn Phe Gin Phe Pro His Pro Leu Leu Pro 
260 265 270 



Asn Val Asp Phe Val Gly Gly Leu His Cys hys Pro Ala Lys Pro Leu 
275 280 285 



Pro Lys Glu Met Glu Asp Phe Val Gin Ser Ser Gly Glu Asn Gly Val 
290 295 300 

Val Val Phe Ser Leu Gly Ser Met Val Ser Asn Met Thr Glu Glu Arg 
305 310 315 320 

Ala Asn Val He Ala Ser Ala Leu Ala Gin He Pro Gin Lys Val Leu 
325 330 335 



Trp Arg Phe Asp Gly Asn Lys Pro Asp Thr Leu Gly Leu Asn Thr Arg 
340 345 350 



Leu Tyr Lys Trp He Pro Gin Asn Asp Leu Leu Gly His Pro Lys Thr 
355 360 365 



Arg Ala Phe He Thr His Gly Gly Ala Asn Gly He Tyr Glu Ala He 
^70 375 380 



Tyr His Gly He Pro Met Val Gly He Pro Leu Phe Ala Asp Gin Pro 
385 390 395 400 
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Asp Asn He Ala His Met Lys Ala Arg Gly Ala Ala Val Arg Val Asp 
405 410 415 



Phe Asn Thr Met Ser Ser Thr Asp Leu Leu Asn Ala Leu Lys Arg Val 
420 425 430 ■ 



lie Asn Asp Pro Ser Tyr Lys Glu Asn Val Met Lys Leu Ser Arg He 
435 440 445 



Gin His Asp Gin Pro Val Lys Pro Leu Asp Arg Ala Val Phe Trp He 
450 455 460 



Glu Phe Val Met Arg His Lys Gly Ala Lys His Leu Arg Val Ala Ala 
465 470 475 480 



His Asp Leu Thr Trp Phe Gin Tyr His Ser Leu Asp Val He Gly Phe 
485 490 495 



Leu Leu Val Cys Val Ala Thr Val He Phe He Val Thr Lys Cys Cys 
500 505 510 



Leu Phe Cys Phe Trp Lys Phe Ala Arg Lys Ala Lys Lys Gly Lys Asn 
515 520 525 



Asp 



<210> 116 

<211> 2S72 

<212> PRT 

<213> Homo Sapiens 

<400> 116 

Met Leu Gin Cys Thr Pro Ala Asn Met Val Glu Val His Lys Asp Lys 
15 10 15 



Glu Ser Ser Lys Gly His Thr Arg His Lys Val Glu Glu Ala Leu He 
20 25 30 



Asn Glu Glu Ala He Leu Asn Leu Met Glu Asn Ser Gin Thr Phe Gin 
35 40 45 



Pro Leu Thr Gin Arg Leu Ser Glu Ser Pro Val Phe Met Asp Ser Ser 
50 55 60 



Pro Asp Glu Ala 



Leu Val His Leu Leu Ala Gly Leu Glu Ser Asp Gly 
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Tyr Arg Gly Glu Arg Asn Arg Met Pro Ser Pro Cys Arg Ser Phe 



85 



90 



Gly 



95 



Asn Asn Lys Tyr Pro Gin Asn Ser Asp Asp Glu Glu Asn Glu Pro Gin 
100 105 no 

He Glu Lys Glu Glu Wet Glu Leu Ser Leu Val Met Ser Gin Arg Tro 
H5 120 125 



Asp Ser Asn He Glu Glu His Cys Ala Lys Lys Arg Ser Leu Cys Arq 
130 135 " y 



140 



Asn Thr His Arg Ser Ser Thr Glu Asp Asp Asp Ser Ser Ser Gly Glu 



150 



155 



160 



Glu Met Glu Trp Ser Asp Asn Ser Leu Leu Leu Ala Ser Leu Ser He 
16 5 170 175 

Pro Gin Leu Asp Gly Thr Ala Asp Glu Asn Ser Asp Asn Pro Leu Asn 
18° 185 190 

Asn Glu Asn Ser Arg Thr His Ser Ser Val He Ala Thr Ser Lys Leu 
195 200 205 

Ser Val Lys Pro Ser He Phe His Lys Asp Ala Ala Thr Leu Glu Pro 
210 215 220 



Ser Ser Ser Ala Lys lie Thr Phe Gin Cys Lys His Thr Ser Ala Leu 
225 230 235 240 

Ser Ser His Val Leu Asn Lys Glu Asp Leu He Glu Asp Leu Ser Gin 
245 250 255 

Thr Asn Lys Asn Thr Glu Lys Gly Leu Asp Asn Ser Val Thr Ser Phe 
2 6° 265 270 

Thr Asn Glu Ser Thr Tyr Ser Met Lys Tyr Pro Gly Ser Leu Ser Ser 
275 280 285 



Thr Val His Ser Glu Asn Ser His Lys Glu Asn Ser Lys Lys Glu He 
250 295 300 



Leu Pro Val Ser Ser Cys Glu Ser Ser 
305 310 



He Phe Asp Tyr Glu Glu Asp 
315 320 
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He Pro Ser Val Thr Arg Gin Val Pro Ser Arg Lys Tyr Thr Asn He 
325 330 335 



Arg Lys He Glu Lys Asp Ser Pro Phe He His Met His Arg His Pro 
340 345 350 



Asn Glu Asn Thr Leu Gly Lys Asn Ser Phe Asn Phe Ser Asp Leu Asn 
355 360 365 



His Ser Lys Asn Lys Val Ser Ser Glu Gly Asn Glu Lys Gly Asn Ser 
370 375 380 



Thr Ala Leu Ser Ser Leu Phe Pro Ser Ser Phe Thr Glu Asn Cys Glu 
385 390 395 400 



Leu Leu Ser Cys Ser Gly Glu Asn Arg Thr Met Val His Ser Leu Asn 
405 410 415 



Ser Thr Ala Asp Glu Ser Gly Leu Asn Lys Leu Lys He Arg Tyr Glu 
420 425 430 



Glu Phe Gin Glu His Lys Thr Glu Lys Pro Ser Leu Ser Gin Gin Ala 
435 440 445 



Ala His Tyr Met Phe Phe Pro Ser Val Val Leu Ser Asn Cys Leu Thr 
450 455 460 



Arg Pro Gin Lys Leu Ser Pro Val Thr Tyr Lys Leu Gin Pro Gly Asn 
455 470 475 480 



Lys Pro Ser Arg Leu Lys Leu Asn Lys Arg Lys Leu Ala Gly His Gin 
485 490 • 495 



Glu Thr Ser Thr Lys Ser Ser Glu Thr Gly Ser Thr Lys Asp Asn Phe 
500 505 510 



He Gin Asn Asn Pro Cys Asn Ser Asn Pro Glu Lys Asp Asn Ala Leu 
515 520 525 



Ala Ser Asp Leu Thr Lys Thr Thr Arg Gly Ala Phe Glu Asn Lys Thr 
530 535 540 



Pro Thr Asp Gly Phe He Asp Cys His Phe Gly Asp Gly Thr Leu Glu 
545 550 555 560 
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Thr Glu Gin Ser Phe Gly Leu Tyr Gly Asn Lys Tyr Thr Leu Arg Ala 
565 570 575 



Lys Arg Lys Val Asn Tyr Glu Thr Glu Asp Ser Glu Ser Ser Phe Val 
580 585 590 



Thr His Asn Ser Lys lie Ser Leu Pro His Pro Met Glu lie Gly Glu 
595 600 605 



Ser Leu Asp Gly Thr Leu Lys Ser Arg Lys Arg Arg Lys Met Ser Lys 
610 615 620 



Lys Leu Pro Pro Val He lie Lys Tyr He He He Asn Arg Phe Arg 
625 630 635 640 



Gly Arg Lys Asn Met Leu Val Lys Leu Gly Lys lie Asp Ser Lys Glu 
645 650 655 



Lys Gin Val Xle Leu Thr Glu Glu Lys Met Glu Leu Tyr Lys Lys Leu 
660 665 670 



Ala Pro Leu Lys Asp Phe Trp Pro Lys Val Pro Asp Ser Pro Ala Thr 
675 680 . 685 



Lys Tyr Pro He Tyr Pro Leu Thr Pro Lys Lys Ser His Arg Arg Lys 
690 695 700 



Ser Lys His Lys Ser Ala Lys Lys Lys Thr Gly Lys Gin Gin Arg Thr 
705 710 715 720 



Asn Asn Glu Asn He Lys Arg Thr Leu Ser Phe Arg Lys Lys Arg Ser 
725 730 735 



His Ala He Leu Ser Pro Pro Ser Pro Ser Tyr Asn Ala Glu Thr Glu 
740 745 750 



Asp Cys Asp Leu Asn Tyr Ser Asp Val Met Ser Lys Leu Gly Phe Leu 
755 760 765 



Ser Glu Arg Ser Thr Ser Pro He Asn Ser Ser Pro Pro Arg Cys Trp 
770 775 780 



Ser Pro Thr Asp Pro Arg Ala Glu Glu He Met Ala Ala Ala Glu Lys 
785 790 795 800 
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Glu Ala Met Leu Phe Lys Gly Pro As n val Tyr Lys Lys Thr Val Asn 
805 8 io 815 



Ser Arg lie Gly Lys Thr Ser Arg Ala Arg Ala Gin lie Lys Lys Ser 
820 825 820 



Lys Ala Lys Leu Ala Asn Pro Ser lie Val Thr Lys Lys Arg Asn Lys 
835 840 845 



Arg Asn Gin Thr Asn Lys Leu Val Asp Asp Gly Lys Lys Lys Pro Arg 
850 855 * 860 



Ala Lys Gin Lys Thr Asn Glu Lys Gly Thr Ser Arg Lys His Thr Thr 
865 870 875 880 



Leu Lys Asp Glu Lys lie Lys Ser Gin Ser Gly Ala Glu Val Lys Phe 
885 890 895 



Val Leu Lys His Gin Asn Val Ser Glu Phe Ala Ser Ser Ser Gly Gly 
900 90S 910 



Ser Gin Leu Leu Phe Lys Gin Lys Asp Met Pro Leu Met Gly Ser Ala 
915 920 925 



Val Asp His Pro Leu Ser Ala Ser Leu Pro Thr Gly lie Asn Ala Gin 
930 935 940 



Gin Lys Leu Ser Gly Cys Phe Ser Ser Phe Leu Glu Ser Lys Lys Ser 
9 *5 950 955 960 



Val Asp Leu Gin Thr Phe Pro Ser Ser Arg Asp Asp Leu His Pro Ser 
965 970 " 975 



Val Val Cys Asn Ser lie Gly Pro Gly Val ser Lys lie Asn Val Gin 
980 985 990 



Arg Pro His Asn Gin Ser Ala Met Phe Thr Leu Lys Glu Ser Thr Leu 
935 1000 1005 



He Gin Lys Asn He Phe Asp Leu Ser Asn His Leu Ser Gin Val 
1010 1015 1020 



Ala Gin Asn Thr Gin He Ser Ser Gly Met Ser Ser Lys He Glu 
1025 1030 1035 



Asp Asn Ala Asn Asn He Gin Arg Asn Tyr Leu Ser Ser He Gly 
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Lys Leu Ser Glu Tyr Arg Asn Ser Leu Glu Ser Lys Leu Asp Gin 
1055 1060 1065 



Ala Tyr Thr Pro Asn Phe Leu His Cys Lys Asp Ser Gin Gin Gin 
1070 1075 1080 



He Val Cys He Ala Glu Gin Ser Lys His Ser Glu Thr Cys Ser 
1085 1090 1095 



Pro Gly Asn Thr Ala Ser Glu Glu Ser Gin Met Pro Asn Asn Cys 
1100 1105 1110 



Phe Val Thr Ser Leu Arg Ser Pro He Lys Gin He Ala Trp Glu 
" 1115 1120 1125 



Gin Lys Gin Arg Gly Phe He Leu Asp Met Ser Asn Phe Lys Pro 
1130 1135 1140 



Glu Arg Val Lys Pro Arg Ser Leu Ser Glu Ala He Ser Gin Thr 
1145 1150 1155 



Lys Ala Leu Ser Gin Cys Lys Asn Arg Asn Val Ser Thr Pro Ser 
1160 1165 1170 



Ala Phe Gly Glu Gly Gin Ser Gly Leu Ala Val Leu Lys Glu Leu 
1175 1180 1185 



Leu Gin Lys Arg Gin Gin Lys Ala Gin Asn Ala Asn Thr Thr Gin 
1190 1195 1200 



Asp Pro Leu Ser Asn Lys His Gin Pro Asn Lys Asn He Ser Gly 
1205 1210 1215 



Ser Leu Glu His Asn Lys Ala Asn Lys Arg Thr Arg Ser Val Thr 
1220 1225 1230 



Ser Pro Arg Lys Pro Arg Thr Pro Arg Ser Thr Lys Gin Lys Glu 
1235 1240 1245 



Lys He Pro Lys Leu Leu Lys Val Asp Ser Leu Asn Leu Gin Asn 
1250 1255 1260 



Ser Ser Gin Leu Asp Asn Ser Val Ser Asp Asp Ser Pro He Phe 
1265 1270 1275 
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Phe Ser Asp Pro Gly Phe Glu Ser Cys Tyr Ser Leu Glu Asp Ser 
1280 1285 1290 



Leu Ser Pro Glu His Asn Tyr Asn Phe Asp He Asn Thr He Gly 
1295 1300 * 1305 



Gin Thr Gly Phe Cys Ser Phe Tyr Ser Gly Ser Gin Phe Val Pro 
1310 1315 1320 



Ala Asp Gin Asn Leu Pro Gin Lys Phe Leu Ser Asp Ala Val Gin 
1325 1330 1335 



Asp Leu Phe Pro Gly Gin Ala He Glu Lys Asn Glu Phe Leu Ser 
1340 1345 1350 



His Asp Asn Gin Lys Cys Asp Glu Asp Lys His His Thr Thr Asp 
1355 1360 1365 



Sear Ala Ser Trp He Arg Ser Gly Thr Leu Ser Pro Glu He Phe 
1370 1375 1380 



Glu Lys Ser Thr He Asp Ser Asn Glu Asn Arg Arg His Asn Gin 
1385 1390 1395 



Trp Lys Asn Ser Phe His Pro Leu Thr Thr Arg Ser Asn Ser He 
1400 1405 1410 



Met Asp Ser Phe Cys Val Gin Gin Ala Glu Asp Cys Leu Ser Glu 
1415 1420 1425 



Lys Ser Arg Leu Asn Arg Ser Ser Val Ser Lys Glu Val Phe Leu 
1430 1435 1440 



Ser Leu Pro Gin Pro Asn Asn Ser Asp Trp He Gin Gly His Thr 
1445 1450 ' 1455 



Arg Lys Glu Met Gly Gin Ser Leu Asp Ser Ala Asn Thr Ser Phe 
1460 1465 1470 



Thr Ala He Leu Ser Ser Pro Asp Gly Glu Leu Val Asp Val Ala 
1475 1480 1485 



Cys Glu Asp Leu Glu Leu Tyr Val Ser Arg Asn Asn Asp Met Leu 
1490 1495 1500 
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Thr Pro Thr Pro Asp Ser Ser Pro Arg Ser Thr Ser Ser Pro Ser 
1505 1510 1515 



Gin Ser Lys Asn Gly Ser Phe Thr Pro Arg Thr Ala Asn lie Leu 
1520 1525 1530 



Lys Pro Leu Met Ser Pro Pro Ser Arg Glu Glu lie Met Ala Thr 
1535 1540 " 1545 



Leu Leu Asp His Asp Leu Ser Glu Thr lie Tyr Gin Glu Pro Phe 
1550 1555 1560 



Cys Ser Asn Pro Ser Asp Val Pro Glu Lys Pro Arg Glu lie Gly 
1565 1570 1575 



Gly Arg Leu Leu Met Val Glu Thr Arg Leu Ala Asn Asp Leu Ala 
1580 1585 1590 



Glu Phe Glu Gly Asp Phe Ser Leu Glu Gly Leu Arg Leu Trp Lys 
1595 1600 1605 



Thr Ala Phe Ser Ala Met Thr Gin Asn Pro Arg Pro Gly Ser Pro 
1610 1615 1620 



Leu Arg Ser Gly Gin Gly Val Val Asn Lys Gly Ser Ser Asn Ser 
1625 1630 1635 



Pro Lys Met Val Glu Asp Lys Lys lie Val lie Met Pro Cys Lys 
1640 1645 1650 



Cys Ala Pro Ser Arg Gin Leu Val Gin Val Trp Leu Gin Ala Lys 
1655 1660 1665 



Glu Glu Tyr Glu Arg Ser Lys Lys Leu Pro Lys Thr Lys Pro Thr 
1670 1675 1680 



Gly Val Val Lys Ser Ala Glu Asn Phe Ser Ser Ser Val Asn Pro 
1685 1690 1695 



Asp Asp Lys Pro Val Val Pro Pro Lys Met Asp Val Ser Pro Cys 
1700 1705 1710 



He Leu Pro Thr Thr Ala His Thr Lys Glu Asp Val Asp Asn Ser 
1715 1720 1725 
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Gin lie Ala Leu Gin Ala Pro Thr Thr Gly Cys Ser Gin Thr Ala 
1730 1735 • X740 

Ser Glu Ser Gin Met Leu Pro Pro Val Ala Ser Ala Ser Asp Pro 
1745 1750 1755 

Glu Lys Asp Glu Asp Asp Asp Asp Asn Tyr Tyr He Ser Tyr Ser 
1760 1765 1770 

Ser Pro Asp Ser Pro Val He Pro Pro Trp Gin Gin Pro He Ser 
1775 1780 1785 

Pro Asp Ser Lys Ala Leu Asn Gly Asp Asp Arg Pro Ser Ser Pro 
1790 1795 1800 

Val Glu Glu Leu Pro Ser Leu Ala Phe Glu Asn Phe Leu Lys Pro 
1805 1810 1815 

lie Lys Asp Gly He Gin Lys Ser Pro Cys Ser Glu Pro Gin Glu 
1820 1825 1830 

Pro Leu Val He Ser Pro He Asn Thr Arg Ala Arg Thr Gly Lys 
1835 1840 1845 

Cys Glu Ser Leu Cys Phe His Ser Thr Pro He He Gin Arg Lys 
1850 1855 I860 

Leu Leu Glu Arg Leu Pro Glu Ala Pro Gly Leu Ser Pro Leu Ser 
1S65 1870 1875 

Thr Glu Pro Lys Thr Gin Lys Leu Ser Asn Lys Lys Gly Ser Asn 
1880 1885 1890 

Thr Asp Thr Leu Arg Arg Val Leu Leu Thr Gin Ala Lys Asn Gin 
• 1895 1900 1905 

Phe Ala Ala Val Asn Thr Pro Gin Lys Glu Thr Ser Gin He Asp 
1910 1915 1920 

Gly Pro Ser Leu Asn Asn Thr Tyr Gly Phe Lys Val Ser He Gin 
1925 1930 1935 

Asn Leu Gin Glu Ala Lys Ala Leu His Glu He Gin Asn Leu Thr 
1940 1945 1950 



Leu He Ser Val Glu Leu His Ala Arg Thr Arg Arg Asp Leu Glu 
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19S 5 I960 1965 

Pro Asp Pro Glu Phe Asp Pro He Cys Ala Leu Phe Tyr Cys He 
19 ? 0 1975 1980 

Ser Ser Asp Thr Pro Leu Pro Asp Thr Glu Lys Thr Glu Leu Thr 
1985 1990 1995 

Gly Val He Val Xle Asp Lys Asp Lys Thr Val Phe Ser Gin Ast> 
2000 2005 2010 

He Arg Tyr Gin Thr Pro Leu Leu lie Arg Ser Gly Xle Thr Gly 
2015 2020 2025 

Leu Glu Val Thr Tyr Ala Ala Asp Glu Lys Ala Leu Phe His Glu 
2030 2035 2040 

He Ala Asn He He Lys Arg Tyr Asp Pro Asp He Leu Leu Gly 
2045 2050 2055 

Tyr Glu He Gin Met His Ser Trp Gly Tyr Leu Leu Gin Arg Ala 
2 °60 2065 2070 

Ala Ala Leu Ser He Asp Leu Cys Arg Met He Ser Arg Val Pro 
2075 2080 * 2085 

Asp Asp Lys He Glu Asn Arg Phe Ala Ala Glu Arg Asp Glu Tyr 
2090 2095 2100 

Gly Ser .Tyr Thr Met Ser Glu He Asn He Val Gly Arg He Thr 
2105 2110 2115 

Leu Asn Leu Trp Arg He Met Arg Asn Glu Val Ala Leu Thr Asn 
2120 2125 2130 

Tyr Thr Phe Glu Asn Val Ser Phe His Val Leu His Gin Arg Phe 
2135 2140 2145 

Pro Leu Phe Thr Phe Arg Val Leu Ser Asp Trp Phe Asp Asn Lys 
2150 2155 2160 

Thr Asp Leu Tyr Arg Tyr Cys Ser He Thr Leu Lys Lys Arg Gin 
2165 2170 2175 



Gin Thr ser Ala Leu Tyr His Trp Gin Val Leu Gly Pro He 
2180 2185 2190 
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Phe Trp Val He Phe Thr Ser Tyr Asn lie Lys lie Leu Phe Met 
2195 2200 2205 



Asp Leu Leu Arg Val Leu Leu Phe val Phe Leu Arg Arg Trp Lys 
2210 2215 2220 



Met Val Asp His Tyr Val Ser Arg Val Arg Gly Asn Leu Gin Met 
2225 2230 2235 



Leu Glu Gin Leu Asp Leu He Gly Lys Thr Ser Glu Met Ala Arg 
2240 2245 2250 



Leu Phe Gly lie Gin Phe Leu His Val Leu Thr Arg Gly Ser Gin 
2255 2260 2 265 



Tyr Arg Val Glu Ser Met Met Leu Arg He Ala Lys Pro Met Asn 
2270 2275 2280 



Tyr He Pro Val Thr Pro Ser Val Gin Gin Arg Ser Gin Met Arg 
2285 2290 2295 



Ala Pro Gin Cys Val Pro Leu He Met Glu Pro Glu Ser Arg Phe 
2300 2305 2310 



Tyr Ser Asn Ser Val Leu Val Leu Asp Phe Gin Ser Leu Tyr Pro 
2315 2320 2325 



Ser He Val lie Ala Tyr Asn Tyr Cys Phe Ser Thr Cys Leu Gly 
2330 2335 2340 



His Val Glu Asn Leu Gly Lys Tyr Asp Glu Phe Lys Phe Gly Cys 
2345 2350 2355 



Thr Ser Leu Arg Val Pro Pro Asp Leu Leu Tyr Gin Val Arg His 
2360 2365 2370 



Asp He Thr Val Ser Pro Asn Gly Val Ala Phe Val Lys Pro Ser 
2375 2380 2385 



Val Arg Lys Gly Val Leu Pro Arg Met Leu Glu Glu He Leu Lys 
2390 2395 2400 



Thr Arg Phe Met Val Lys Gin Ser Met Lys Ala Tyr Lys Gin Asp 
2405 2410 2415 
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Arg Ala Leu Ser Arg Met Leu Asp Ala Arg Gin Leu Gly Leu Lys 
2420 2425 ~ 2430 



Leu lie Ala Asn Val Thr Phe Gly Tyr Thr Ser Ala Asn Phe Ser 
2435 2440 2445 



Gly Arg Met Pro Cys lie Glu Val Gly Asp Ser lie Val His Lys 
2450 2455 2460 



Ala Arg Glu Thr Leu Glu Arg Ala lie Lys Leu Val Asn Asp Thr 
2465 2470 2475 



Lys Lys Trp Gly Ala Arg Val Val Tyr Gly Asp Thr Asp Ser Wet 
2480 2485 2490 



Phe Val Leu Leu Lys Gly Ala Thr Lys Glu Gin Ser Phe Lys He 
2495 2500 2505 



Gly Gin Glu He Ala Glu Ala Val Thr Ala Thr Asn Pro Lys Pro' 
2510 2515 2520 



Val Lys Leu Lys Phe Glu Lys Val Tyr Leu Pro Cys Val Leu Gin 
2525 2530 2535 



Thr Lys Lys Arg Tyr Val Gly Tyr Met Tyr Glu Thr Leu Asp Gin 
2540 2545 2550 



Lys Asp Pro Val Phe Asp Ala Lys Gly He Glu Thr Val Arg Arg 
2555 2560 2565 



Asp Ser Cys Pro Ala Val Ser Lys He Leu Glu Arg Ser Leu Lys 
2570 2575 2580 



Leu Leu Phe Glu Thr Arg Asp He Ser Leu He Lys Gin Tyr Val 
2585 2590 2595 



Gin Arg Gin Cys Met Lys Leu Leu Glu Gly Lys Ala Ser He Gin 
2600 2605 2S10 



Asp Phe He Phe Ala Lys Glu Tyr Arg Gly Ser Phe Ser Tyr Lys 
2615 2620 2625 



Pro Gly Ala Cys Val Pro Ala Leu Glu Leu Thr Ser Phe Phe He 
2630 2635 2640 
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Val Leu Leu Leu Phe Asn Ser Asp Leu lie Cys Glu Lys Asp Gly 
2645 2650 2655 

Phe His Asn Ser lie Trp Val Tip Phe Phe Ser Leu Asn Ser Asn 
2660 2665 2670 

Arg Lys Met Leu Thr Tyr Asp Arg Arg Ser Glu Pro Gin Val Gly 
2675 2680 2685 

Glu Arg Val Pro Tyr Val lie Xle Tyr Gly Thr Pro Gly Val Pro 
2690 2695 ~ ~ 2700 

Leu He Gin Leu Val Arg Arg Pro Val Glu Val Leu Gin Asp Pro 
2705 2710 2715 

Thr Leu Arg Leu Asn Ala Thr Tyr Tyr lie Thr Lys Gin He Leu 
2720 2725 2730 

Pro Pro Leu Ala Arg He Phe Ser Leu He Gly He Asp Val Phe 
2735 2740 2745 

Ser Trp Tyr His Glu Leu Pro Arg He His Lys Ala Thr Ser Ser 
2750 2755 2760 

Ser Arg Ser Glu Pro Glu Gly Arg Lys Gly Thr lie Ser Gin Tyr 
2765 2770 * 2775 

Phe Thr Thr Leu His Cys Pro Val Cys Asp Asp Leu Thr Gin His 
2780 2785 2790 

Gly He Cys Ser Lys Cys Arg Ser Gin Pro Gin His Val Ala Val 
2795 2800 2805 

He Leu Asn Gin Glu He Arg Glu Leu Glu Arg Gin Gin Glu Gin 
2810 2815 2820 

Leu Val Lys He Cys Lys Asn Cys Thr Gly Cys Phe Asp Arg His 
2825 2830 . 2835 

He Pro Cys Val Ser Leu Asn Cys Pro Val Leu Phe Lys Leu Ser 
2840 2845 2850 

Arg Val Asn Arg Glu Leu Ser Lys Ala Pro Tyr Leu Arg Gin Leu 
285 5 2860 2865 



Leu Asp Gin Phe 
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Although claims 22 and 24 are directed to a method of treatment of the 
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effects of the compound/composition. 

2. Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
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3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple (groups of) 
. inventions in this international application, as follows: 

Invention 1: Claims 1-27 (partially) 

Method for detecting a pathological cell in a biological 
sample by detection of a nucleic acid with SEQ ID NO 1 or a 
protein with SEQ ID NO 59. An isolated nucleic acid with SEQ 
ID NO 1. Vectors or host cells containing the isolated 
nucleic acid with SEQ ID N0:1. An isolated nucleic acid 
encoding the polypeptide with SEQ ID N0:59. A polypeptide 
encoded by the nucleic with SEQ ID N0:1. An antibody that 
binds specifically the polypeptide with SEQ ID N0:1. A 
method for targetting a pathological .cell in a, patient,, 
employing the said antibody. A method for detecting a 
pathological cell in a patient employing said antibody. A 
method for identifying compounds modulating 
pathology-associated polypeptides by contacting the compound 
with a polypeptide encoded by the polynucleotide with SEQ ID 
N0:1. Screening assay involving the comparison of 
expression levels of SEQ ID N0:1 from a pathological cell 
and a cell that does not show the pathology. 



Invention 2: Claims 1-27 (partially) 

Method for detecting a pathological cell in a biological 
sample by detection of a nucleic acid with SEQ ID NO 2 or a 
protein with SEQ ID NO 50. An isolated nucleic acid with SEQ 
ID NO 2. Vectors or host cells containing the isolated 
nucleic acid with SEQ ID N0:2. An isolated nucleic acid 
encoding the polypeptide with SEQ ID NO: 60. A polypeptide 
encoded by the nucleic with SEQ ID N0:2. An antibody that 
binds specifically the polypeptide with SEQ ID N0:2. A 
method for targetting a pathological cell in a patient 
employing the said antibody. A method for detecting a 
pathological cell in a patient employing said antibody. A 
method for identifying compounds modulating 
pathology-associated polypeptides by contacting the compound 
with a polypeptide encoded by the polynucleotide with SEQ ID 
N0:2. Screening assay involving the comparison of 
expression levels of SEQ ID NO: 2 from a pathological cell 
and a cell that does not show the pathology. 
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idem for each of the polynucleotide - polypeptide pairs with 
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